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The invention relates generally to keyboard systems, 
and more specifically to reduced keyboards such as 
those implemented on a touch-sensitive panel or 
display screen and to mechanical keyboard systems, 
using word-level analysis to resolve inaccuracies in 
user entries. This is an example of where text might 
appear in the text output region of the system. As 
the user enters a sequence of keystrokes in the 
auto-correcting keyboard region, Ests of words 
corresponding to each entered sequence are 
displayed in the word choice list If the desired 
word appears as the default choice, the user can 
select it by simply typing a space and continuing to 
enter the next word- The user may also directly 
select any desired word from the Hst. Once 
any word has been selected, the word is output j 
the cursor position in the |_ 
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(57) Abstract: An enhanced 
text entry system using 
word-level analysis to auto- 
matically correct inaccuracies 
in user keystroke entries 
on reduced keyboards. The 
keyboard (105) may be a part 
of a touch-sensitive panel or 
display screen (100) or on a 
mechanical keyboard system. 
A method and system are 
defined which determine one 
or more alternative textual 
interpretations of each sequence 
of inputs detected within a 
designated auto-correcting 
keyboard region (106). The 
actual contact locations for the 
keystrokes may occur outside 
the boundaries of the specific 
keyboard key regions, where 
the distance from each contact 
location to each corresponding 
intended character may in 
general increase with the 
expected frequency of the 
intended word in the language 
or in a particular context. 
The user can easily select the 
intended word from among the 
generated interpretations. 
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KEYBOARD SYSTEM WITH AUTOMATIC CORRECTION 

Technical Field of the Invention 

The invention provides keyboard systems that auto-correct "sloppy" text input due to 
5 errors in touching a keyboard or screen. More specifically, the invention provides a reduced 
keyboard such as those implemented on a touch-sensitive panel or display screen and to 
mechanical keyboard systems, using word-level analysis to resolve inaccuracies (sloppy text 
entry) in user entries. This invention is in the field of keypad input preferably for small 
electronic devices. 

1 0 Background of the Invention 

For many years, portable computers have been getting smaller and smaller. The 
principal size-limiting component in the effort to produce a smaller portable computer has been 
the keyboard. If standard typewriter-size keys are used, the portable computer must be at least 
as large as the keyboard. Miniature keyboards have been used on portable computers, but the 

1 5 miniature keyboard keys have been found to be too small to be easily or quickly manipulated 
with sufficient accuracy by a user. 

Incorporating a full-size keyboard in a portable computer also hinders true portable use 
of the computer. Most portable computers cannot be operated without placing the computer on 
a flat work surface to allow the user to type with both hands. A user cannot easily use a 

20 portable computer while standing or moving. In the latest generation of small portable 

computers, called Personal Digital Assistants (PDAs), companies have attempted to address 
this problem by incorporating handwriting recognition software in the PDA. A user may 
directly enter text by writing on a touch-sensitive panel or display screen. This handwritten 
text is then converted into digital data by the recognition software. Unfortunately, in addition 

25 to the fact that printing or writing with a pen is in general slower than typing, the accuracy and 
speed of the handwriting recognition software has to date been less than satisfactory. To make 
matters worse, today's handheld computing devices which require text input are becoming 
smaller still. Recent advances in two-way paging, cellular telephones, and other portable 
wireless technologies has led to a demand for small and portable two-way messaging systems, 

30 and especially for systems which can both send and receive electronic mail ("e-mail"). 

It would therefore be advantageous to develop a much smaller keyboard for entry of 
text into a computer. As the size of the keyboard is reduced, the user encounters greater 
difficulty selecting the character of interest. In general there are two different types of 
keyboards utilized in such portable devices. One is the familiar mechanical keyboard 

35 consisting of a set of mechanical keys that are activated by depressing them with a finger or 
thumb. However, these mechanical keyboards tend to be significantly smaller than the 
standard sized keyboards associated with typewriters, desktop computers, and even "laptop" 
computers. As a result of the smaller physical size of the keyboard, each key is smaller and in 
closer proximity to neighboring keys. This increases the likelihood that the user will depress 
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an unintended key, and the likelihood of keystroke errors tends to increase the faster the user 
attempts to type. 

Another commonly used type of keyboard consists of a touch-sensitive panel on which 
some type of keyboard overlay has been printed, or a touch-sensitive display screen on which a 
5 keyboard overlay can be displayed. Depending on the size and nature of the specific keyboard, 
either a finger or a stylus can be used to contact the panel or display screen within the area 
associated with the key that the user intends to activate. Due to the reduced size of many 
portable devices, a stylus is often used in order to attain sufficient accuracy in contacting the 
keyboard to activate each intended key. Here again, the small overall size of such keyboards 

10 results in a small area being associated with each key so that it becomes quite difficult for the 
average user to type quickly with sufficient accuracy. 

One area of prior development in mechanical keyboards has considered the use of keys 
that are much smaller than those found on common keyboards. With smaller keys, the user 
must take great care in controlling each key press. One approach (U.S. Patent 5,612,690) 

15 proposes a system that uses up to four miniature keys in unison to define primary characters 
(the alphabet) and nests secondary character rows (like numbers) between primary character 
rows. Selecting a secondary character involves depressing the miniature key from each of the 
surrounding primary characters. Grouping the smaller keys in this fashion creates a larger 
apparent virtual key composed of four adjacent smaller keys, such that the virtual key is large 

20 enough to be depressed using a finger. However, the finger must contact the keys more or less 
precisely on the "cross-hairs" of the boundaries between the four adjacent keys in order to 
depress them in unison. This makes it still difficult to type quickly with sufficient accuracy. 

Another area of prior development in both touch screen and mechanical keyboards has 
considered the use of a much smaller quantity of full-size keys. With fewer keys, each single 

25 key press must be associated with a plurality of letters, such that each key activation is 

ambiguous as to which letter is intended. As suggested by the keypad layout of a touch-tone 
telephone, many of the reduced keyboards have used a 3-by-4 array of keys, where each key is 
associated with three or four characters (U.S. Patent 5,818,437). Several approaches have been 
suggested for resolving the ambiguity of a keystroke sequence on such a keyboard. While this 

30 approach has merit for such keyboards with a limited number of keys, it is not applicable to 
reduced size keyboards with a full complement of keys. 

Another approach in touch screen keyboards has considered analyzing the immediately 
preceding few characters in order to determine which character should be generated for a 
keystroke that is not close to the center of the display location of a particular character (U.S. 

35 Patent 5,748,512). When the keyboard is displayed on a small touch screen, keystrokes that 
are off-center from a character are detected. Software compares the possible text strings of 
probable sequences of two or three typed characters against known combinations, such as a 
history of previously typed text or a lexicon of text strings rated for their frequency within a 
context. When the character generated by the system is not the character intended by the user, 
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the user must correct the character before going on to select the a following character, because 
the generated character will be used to determine probabilities for the following keystroke. 

The fundamental problem is that the specific activations that result from a user's 
attempts to activate the keys of a keyboard do not always precisely conform to the intentions of 
5 the user. On a touch screen keyboard, the user's finger or stylus may hit the wrong character 
or hit between keys in a boundary area not associated with a specific character. With a 
miniaturized mechanical keyboard, a given keypress may activate the wrong key, or may 
activate two or more keys either simultaneously or with a "roll-over" motion that activates 
adjacent keys in a rapid sequence. Other examples include common keyboards operated by 

10 users with limited ranges of motion or motor control, where there is a limited ability to 

consistently strike any particular space or key, or where the limb (such as in the case of an 
amputee, or the use of gloved hands or gloved fingers) or the device used to make the entry 
(such as a stylus) is far larger than the targeted key or character space. 
Summary of the Invention 

15 The present invention provides an enhanced text entry system that uses word-level 

disambiguation to automatically correct inaccuracies in user keystroke entries. Specifically, 
the present invention provides a text entry system comprising: 

(a) a user input device comprising a touch sensitive surface including an auto- 
correcting keyboard region comprising a plurality of the characters of an alphabet, wherein 

20 each of the plurality of characters corresponds to a location with known coordinates in the 

auto-correcting keyboard region, wherein each time a user contacts the user input device within 
the auto-correcting keyboard region, a location associated with the user contact is determined 
and the determined contact location is added to a current input sequence of contact locations; 

(b) a memory containing a plurality of objects, wherein each object is a string of 
25 one or a plurality of characters forming a word or a part of a word, wherein each object is 

further associated with a frequency of use; 

(c) an output device with a text display area; and 

(d) a processor coupled to the user input device, memory, and output device, said 
processor comprising: 

30 (i) a distance value calculation component which, for each determined 

contact location in the input sequence of contacts, calculates a set of distance values between 
the contact locations and the known coordinate locations corresponding to one or a plurality of 
characters within the auto-correcting keyboard region; 

(ii) a word evaluation component which, for each generated input sequence, 

35 identifies one or a plurality of candidate objects in memory, and for each of the one or a 
plurality of identified candidate objects, evaluates each identified candidate object by 
calculating a matching metric based on the calculated distance values and the frequency of use 
associated with the object, and ranks the evaluated candidate objects based on the calculated 
matching metric values; and 
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(iii) a selection component for (a) identifying one or a plurality of candidate 
objects according to their evaluated ranking, (b) presenting the identified objects to the user, 
enabling the user to select one of the presented objects for output to the text display area on the 
output device. 

5 Preferably, the selection component further comprises (c) resetting the current input 

sequence of contact locations to an empty sequence upon detecting the selection by the user of 
one of the presented objects for output to the text display area on the output device. 

Preferably, (a) each of the plurality of objects in memory is further associated with one 
or a plurality of predefined groupings of objects; and (b) the word evaluation component, for 

10 each generated input sequence, limits the number of objects for which a matching metric is 
calculated by identifying one or a plurality of candidate groupings of the objects in memory, 
and for one or a plurality of objects associated with each of the one or a plurality of identified 
candidate groupings of objects, calculates a matching metric based on the calculated distance 
values and the frequency of use associated with each candidate object, and ranks the evaluated 

15 candidate objects based on the calculated matching metric values. This reduces the calculation 
required since, conversely, one or more groupings of objects are identified as containing no 
candidate objects for a given input sequence of contacts, such that a matching metric need not 
be calculated for any object in the groupings so identified. 

Preferably, the characters of the alphabet are arranged on the auto-correcting keyboard 

20 region in approximately a standard "QWERTY" layout. Most preferably, the width to height 
ratio of the auto-correcting keyboard region is approximately 2 to 1, or the width to height ratio 
of the auto-correcting keyboard region is less than 2 to 1 . In one embodiment, one or a 
plurality of the characters arranged on the auto-correcting keyboard region are illegible. 

Preferably, the auto-correcting keyboard region includes one or a plurality of known 

25 locations associated with one or a plurality of punctuation characters, wherein the memory 
includes one or a plurality of objects in memory which include one or a plurality of the 
punctuation characters associated with locations in the auto-correcting keyboard region. 
Preferably, the objects in memory are further associated with one or a plurality of modules, 
wherein each module comprises a set of objects with one or a plurality of common 

30 characteristics. In one embodiment, the text entry system comprises a module selector whereby 
a user can determine which modules are to be evaluated by the word evaluation component in 
order to identify candidate objects. In another embodiment, the plurality of modules comprises 
word stem modules and suffix modules, wherein each word stem module comprises a logical 
organization of uninfected word stem objects, and wherein each suffix module comprises a 

35 logical organization of suffixes which can be appended to word stems to form inflected words, 
whereby each word stem module is associated with one or a plurality of suffix modules, 
whereby whenever the word evaluation component calculates a matching metric value for a 
given word stem in a given word stem module with respect to an initial sequence of contacts 
within an input sequence such that the calculated matching metric value ranks higher than a 
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predetermined threshold, the word evaluation component evaluates the remaining contacts of 
the input sequence with respect to the associated suffix modules, whereby whenever the word 
evaluation component calculates a matching metric value for a given suffix in one of said 
associated suffix modules that ranks higher than a second predetermined threshold, said suffix 
5 is appended to said word stem to form a completed word corresponding to a matching metric 
value that is a function of said determined word stem matching metric value and said 
determined suffix matching metric value. 

Preferably, the word evaluation component calculates the matching metric for each 
candidate object by summing the distance values calculated from each contact location in the 

10 input sequence to the location assigned to the character in the corresponding position of the 
candidate object, and applying a weighting function according to the frequency of use 
associated with the object. In addition, each character of the alphabet associated with the auto- 
correcting keyboard region is assigned a Cartesian coordinate and wherein the distance value 
calculation component calculates the distance between the contact location and the location 

1 5 corresponding to a character according to standard Cartesian coordinate distance analysis. 

Further, each character of the alphabet associated with the auto-correcting keyboard region is 
assigned a Cartesian coordinate and wherein the distance value calculation component 
calculates the distance between the contact location and the location corresponding to a 
character as the square of the standard Cartesian coordinate distance. The distance values are 

20 placed in a table. In addition, each location on the auto-correcting keyboard region is defined 
by a horizontal and a vertical coordinate, and wherein the distance value between a contact 
location and the known coordinate location corresponding to a character comprises a horizontal 
and a vertical component, wherein the vertical component is adjusted by a weighting factor in 
calculating the distance of the contact location from the character. The word evaluation 

25 component adds an increment value to the sum of the distance values prior to applying a 
weighting function according to the frequency of use associated with the candidate object. 
Most preferably, the increment value is a fixed value that is approximately twice the average 
distance between adjacent locations on the auto-correcting keyboard region corresponding to 
characters. The frequency of use associated with each candidate object in memory comprises 

30 the ordinal ranking of the object with respect to other objects in memory, wherein an object 

associated with a higher relative frequency corresponds to a numerically lower ordinal ranking. 
Most preferably, the frequency weighting function applied by the word evaluation component 
to the summed distance values for a candidate object comprises multiplying the sum of the 
distance values by the base 2 logarithm of the ordinal ranking of the object. 

35 Preferably, objects in memory are stored such that the objects are classified into 

groupings comprising objects of the same length. The word evaluation component limits the 
number of objects for which a matching metric is calculated by initially identifying candidate 
groupings of objects of the same length as the number of inputs in the input sequence. Most 
preferably, if fewer than a threshold number of candidate objects are evaluated to have a 
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matching metric score better than a threshold value, the word evaluation component identifies 
candidate groupings of objects of progressively longer lengths and calculates the matching 
metric for the objects in the identified groupings until said threshold number of candidate 
objects are evaluated to have a matching metric score better than said threshold. Further, the 
5 word evaluation component calculates the matching metric for each candidate object by 

summing the distance values calculated from each contact location in the input sequence to the 
location assigned to the character in the corresponding position of the candidate object and 
adding an increment value, and applying to this sum a weighting function according to the 
frequency of use associated with the object, and wherein the increment value added to the sum 

10 of the distance values is a value that is based on the difference between the number of 

characters in the candidate object and the number of inputs in the current input sequence. 

Preferably, the word evaluation component calculates the matching metric for each 
candidate object by summing the distance values calculated from each contact location in the 
input sequence to the location assigned to the character in the corresponding position of the 

15 candidate object, and applying a weighting function according to the frequency of use 
associated with the object. Most preferably, the frequency of use associated with each 
candidate object in memory comprises the ordinal ranking of the object with respect to other 
objects in one or a plurality of sub-groupings in memory with which said object is associated, 
wherein an object associated with a higher relative frequency corresponds to a numerically 

20 lower ordinal ranking. In addition, for each calculated distance value between a contact 

location in the input sequence and the known coordinate location corresponding to a character 
within the auto-correcting keyboard region wherein said calculated distance exceeds a 
threshold distance value, for each object in memory in which said character occurs at a position 
in the sequence of the characters of said object corresponding to the position of said contact 

25 location in said input sequence, said object is ranked by the word evaluation component as an 
object that is excluded from presentation to the user for selection. One or a plurality of the 
identified candidate groupings of the objects in memory comprise objects that are excluded 
from presentation to the user for selection, wherein at least one of the calculated distance 
values included in the calculated sum of distance values for each object in said one or 

30 identified candidate groupings of objects exceeds a threshold distance value. The auto- 
correcting keyboard region is separated into two or more predefined clustering regions, each of 
which contains the known locations of one or a plurality of characters, and wherein each 
objects in memory is assigned to a predefined group according to which of said two or more 
predefined clustering regions contain the known locations corresponding to one or a plurality 

35 of the initial characters of said object. In one embodiment, the auto-correcting keyboard region 
is separated into three predefined clustering regions, and wherein each object in memory is 
assigned to one of nine predefined groupings based which of the three predefined clustering 
regions contain the known locations corresponding to each of the first two characters of said 
object. 
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Preferably, for each character corresponding to a known location in the auto-correcting 
keyboard region, a region is predefined around one or a plurality of said known locations 
wherein the distance between an input contact location falling within said predefined region 
and the known character location within said predefined region is calculated as a distance of 
5 zero. Most preferably, the relative sizes of said predefined regions correspond to the relative 
frequencies of occurrence of the characters associated with the known locations within said 
predefined regions. The predefined region around the known location of a character 
corresponds to a displayed key on the touch screen. Further, at least one of the locations with 
known coordinates in the auto-correcting keyboard region corresponds to a plurality of 

10 characters, one or a plurality of which include various diacritic marks, wherein the plurality of 
characters comprise variant forms of a single base character, and wherein objects in memory 
are stored with their correct accented characters. 

Preferably, the selection component presents the identified one or a plurality of 
candidate objects for selection by the user in a candidate object list in the text display area. 

1 5 Most preferably, the selection component identifies the highest ranked candidate object and 
presents the identified object in the candidate object list in the position nearest to the auto- 
correcting keyboard region. In addition, user selection of a character that is associated with a 
contact outside of the auto-correcting keyboard region accepts and outputs the determined 
highest ranked candidate object at a text insertion point in the text display area prior to 

20 outputting the selected character at the text insertion point in the text display area. The user 
selection of an object for output at a text insertion point in the text display area terminates the 
current input sequence such that the next contact within the auto-correcting keyboard region 
starts a new input sequence. In addition, the selection component detects a distinctive manner 
of selection that is used to select a candidate object, and wherein upon detecting that an object 

25 has been selected through said distinctive manner, the system replaces the current input 

sequence of actual contact locations with an input sequence of contact locations corresponding 
to the coordinate locations of the characters comprising the selected object, and wherein a next 
contact in the auto-correcting keyboard region is appended to the current input sequence. 

Preferably, the word evaluation component determines, for each determined contact 

30 location in each input sequence of contact locations, the closest known location corresponding 
to a character, and constructs an exact typing object composed of said determined 
corresponding characters in the order corresponding to the input sequence of contact locations. 
Most preferably, for each input sequence of contact locations, the selection component presents 
said exact typing object to the user for selection. Further, when the user selects said exact 

35 typing object for output to the text display area on the output device and said exact typing 
object is not already included as one of the objects in memory, said exact typing object is 
added to the memory. Prior to displaying the exact typing object to the user for selection, the 
selection component compares the exact typing object to a database of offensive objects, each 
of which is associated with an acceptable alternative object for display, and if a match is found, 
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replaces the exact typing object with the associated acceptable object for presentation to the 
user. 

Preferably, the selection component identifies the highest ranked candidate object and 
presents the identified object at the text insertion point in the text display area on the output 
5 device. Most preferably, the text entry system includes a select key region associated with an 
object selection function, wherein when said select key region is contacted, the object 
presented at the text insertion point in the text display area on the output device is replaced 
with next highest ranked object of the identified one or a plurality of candidate objects. 

Preferably, the text entry system includes a delete key region associated with a delete 

1 0 function, wherein when the current input sequence includes at least one contact and said delete 
key region is contacted, the last input contact from the current input sequence of contacts is 
deleted, without terminating the current input sequence. In another preferred embodiment, the 
text entry system includes an Edit Word key region associated with an Edit Word function, 
wherein when no current input sequence exists and said Edit Word key region is contacted: 

15 (i) when the text insertion point in the text display area on the output device is 

contained within a previously output word, the system establishes a new current input sequence 
consisting of a sequence of contact locations corresponding to the coordinate locations 
associated with the characters of said word, and 

(ii) when the text insertion point in the text display area on the output device is located 

20 between two previously output words, the system establishes a new current input sequence 
consisting of a sequence of contact locations corresponding to the coordinate locations 
associated with the characters of the word adjacent to the text insertion point, and 

wherein the text entry system processes said new current input sequence and 
determines a corresponding ranking of new candidate objects, and wherein selection of one of 

25 the new candidate objects replaces the previously output word used to establish said new 
current input sequence. 

Preferably, as the user enters an input sequence by performing a sequence of contact 
actions within the auto-correcting keyboard region, the processor determines the location 
associated with each user contact action by recording each contact action in the sequence as an 

30 indexed primary set of a fixed number of two or more regularly spaced contact points along the 
path traced out by the user contact action, and by assembling two or more corresponding 
secondary sets of contact points by taking, for each of the two or more possible primary index 
values, the sequence of contact points having the same index value, one from each recorded 
indexed primary set of contact points, and by determining with respect to each word is selected 

35 by the user for output, a minimizing primary index value that identifies the assembled 
secondary set of contact points for which the calculated distance between the assembled 
secondary set of contact points and the known locations corresponding to the characters of the 
selected word is minimized, and whereby for a next input sequence of user contact actions, the 
distance value calculation component calculates distance values based on a sequence of contact 

8 
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locations determined as the secondary set of contact point locations assembled from said next 
input sequence of contact actions corresponding to the determined minimizing primary index 
value. Most preferably, for a plurality of user input sequences, the distance value calculation 
component computes a running average of the distance calculations for each of the two or 
5 more assembled secondary sets corresponding to the two or more primary index values, and 
whereby for a next input sequence of contact actions, the distance value calculation component 
calculates distance values based on a sequence of contact locations determined as the 
secondary set of contact point locations assembled from said next input sequence of contact 
actions corresponding to the minimizing primary index value determined with respect to said 

10 computed running averages. Further, for each primary index value, the distance value 

calculation component computes a running average of the horizontal and vertical components 
of the offset of the coordinate location corresponding to each character of each selected word 
with respect to the coordinate location of each corresponding recorded indexed contact point, 
and wherein in performing distance calculations for the word evaluation component, the 

1 5 distance value calculation component adjusts the horizontal and vertical coordinates of each 
recorded indexed contact point by an amount that is a function of the average horizontal and 
vertical offsets computed with respect to the corresponding primary index value. 

Preferably, for each input contact location, the distance value calculation component 
computes a running average of the horizontal and vertical components of the offset of the 

20 coordinate location corresponding to each character of each selected word with respect to the 
coordinates of each corresponding input contact location, and wherein in performing distance 
calculations for the word evaluation component, the distance value calculation component 
adjusts the horizontal and vertical coordinates of each input contact location by amounts that 
are functions of the computed average signed horizontal and vertical offsets. Alternatively, the 

25 processor further comprises a stroke recognition component that determines for each user 
contact action within the auto-correcting keyboard region whether the point of contact is 
moved less than a threshold distance from the initial contact location prior to being lifted from 
the touch sensitive surface, whereby: 

(a) when the point of contact is moved less than a threshold distance from the initial 

30 contact location prior to being lifted from the touch sensitive surface, the stroke recognition 
component determines that the user contact is a tap contact, and the location determined to be 
associated with the user contact is added to the current input sequence of contact locations to 
be processed by the distance value calculation component, the word evaluation component, and 
the selection component, and 

35 b) when the point of contact is moved greater than or equal to a threshold distance 

from the initial contact location prior to being lifted from the touch sensitive surface, the stroke 
recognition component determines that the user contact is one of a plurality of stroke contacts 
that are associated with known system functions, and classifies the stroke contact as one of the 
plurality of predefined types of stroke contacts. 
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Preferably, when a threshold number of contact locations in the input sequence are 
further than a threshold maximum distance from the corresponding character in the sequence 
of characters comprising a given candidate object, said object is identified as no longer being a 
candidate object for the selection component. Alternatively, the processor further comprises a 
5 frequency promotion component for adjusting the frequency of use associated with each object 
in memory as a function of the number of times the object is selected by the user for output to 
the text display area on the output device. Moreover, the frequency of use associated with each 
object in memory comprises the ordinal ranking of the object with respect to other objects in 
memory, wherein an object associated with a higher relative frequency corresponds to a 

10 numerically lower ordinal ranking, and wherein when an object is selected for output by the 
user, the frequency promotion component adjusts the ordinal ranking associated with said 
selected object by an amount that is a function of the ordinal ranking of said object prior to said 
adjustment. Further, the function used by the frequency promotion component to determine 
the amount by which the ordinal ranking associated with a selected object is adjusted reduces 

15 said amount for objects with ordinal rankings that are associated with relatively higher 

frequencies of use. The frequency promotion component analyzes additional information files 
that are accessible to the text entry system to identify new objects contained in said files that 
are not included among the objects already in said memory of said text entry system, and 
wherein said newly identified objects are added to the objects in memory as objects that are 

20 associated with a low frequency of use. Further, the frequency of use associated with a newly 
identified object that is added to the objects in memory is adjusted by the frequency promotion 
component as a function of the number of times that the newly identified object is detected 
during the analysis of said additional information files. 

Preferably, the processor further comprises a frequency promotion component for 

25 adjusting the frequency of use associated with each object in memory as a function of the 
number of times the object is selected by the user for output to the text display area on the 
output device with respect to other objects associated with the same predefined grouping. 
Most preferably, when an object is selected by the user for output to the text display area on 
the output device, the frequency promotion component increases the value of the frequency 

30 associated with the selected object by a relatively large increment, and decreases by a 

relatively small decrement the frequency associated with unselected objects that are associated 
with the same grouping as the selected object. Alternatively, information regarding the 
capitalization of one or a plurality of objects is stored along with the objects in memory and 
wherein the selection component presents each identified object in a preferred form of 

35 capitalization according to the stored capitalization information. In another embodiment, one 
or a plurality of objects in memory are associated with a secondary object in memory 
comprising a sequence of one or a plurality of letters or symbols, and wherein when the 
selection component identifies one of said objects for presentation to the user based on the 
matching metric calculated by the word evaluation component, the selection component 
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presents the associated secondary object for selection. 

The present invention further provides a text entry system comprising: 

(a) a user input device comprising a keyboard constructed with mechanical keys 

including an auto-correcting keyboard region comprising a plurality of keys, each 
5 corresponding to a character of an alphabet and each at a known coordinate location, wherein 

each time a user activates one or a plurality of adjacent keys in the auto-correcting keyboard 

region within a predetermined threshold period of time to generate a key activation event, a 

determined location corresponding to the key activation event is appended to a current input 

sequence of the determined locations of the key activation events; 
10 (b) a memory containing a plurality of objects, wherein each object is a string of 

one or a plurality of characters forming a word or a part of a word, wherein each object is 

further associated with a frequency of use; 

(c) an output device with a text display area; and 

(d) a processor coupled to the user input device, memory, and output device, said 
1 5 processor comprising: 

(i) a distance value calculation component which, for each generated key 
activation event location in the input sequence of key activation events, calculates a set of 
distance values between the key activation event location and the known coordinate locations 
corresponding to one or a plurality of keys within the auto-correcting keyboard region; 

20 (ii) a word evaluation component which, for each generated input sequence, 

identifies one or a plurality of candidate objects in memory, and for each of the one or a 
plurality of identified candidate objects, evaluates each identified candidate object by 
calculating a matching metric based on the calculated distance values and the frequency of use 
associated with the object, and ranks the evaluated candidate objects based on the calculated 

25 matching metric values; and 

(iii) a selection component for identifying one or a plurality of candidate 
objects according to their evaluated ranking, presenting the identified objects to the user, and 
enabling the user to select one of the presented objects for output to the text display area on the 
output device. 

30 Preferably, (a) each of the plurality of objects in memory is further associated with one 

or a plurality of predefined groupings of objects; and (b) the word evaluation component, for 
each generated input sequence, limits the number of objects for which a matching metric is 
calculated by identifying one or a plurality of candidate groupings of the objects in memory, 
and for one or a plurality of objects associated with each of the one or a plurality of identified 

35 candidate groupings of objects, calculates a matching metric based on the calculated distance 
values and the frequency of use associated with each candidate object, and ranks the evaluated 
candidate objects based on the calculated matching metric values. Further, the keys associated 
with the characters of the alphabet are arranged in the auto-correcting keyboard region in 
approximately a standard "QWERTY" layout. 
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Preferably, when a key activation event is detected comprising the simultaneous 
activation of a plurality of adjacent keys in the auto-correcting keyboard region, a location 
corresponding to said key activation event is determined as a function of the locations of the 
simultaneously activated keys, and said determined location is appended to the current input 
5 sequence of the locations of the key activation events. Most preferably, the function used to 
determine the location of said key activation event comprises the computation of the location 
corresponding to the center of the locations of the simultaneously activated keys. Further, the 
function used to determine the location of said key activation event comprises the computation 
of the location corresponding to the weighted center of gravity of the locations of the 

10 simultaneously activated keys, wherein the weights associated with each of the keys in the 
auto-correcting keyboard region correspond to the relative frequencies of occurrence of the 
characters associated with the keys, wherein said relative frequencies are determined with 
respect to the frequencies of occurrence of the characters in the objects in memory. 

Preferably, when a key activation event is detected comprising the activation of a 

1 5 plurality of adjacent keys in the auto-correcting keyboard region within a predetermined 

threshold period of time, wherein at all times during said key activation event at least one of 
said plurality of adjacent keys is activated and wherein at any moment during said key 
activation event that any subset of said plurality of keys is simultaneously activated, said 
simultaneously activated subset of keys comprises keys that are contiguously adjacent, a 

20 location corresponding to said key activation event is determined as a function of the locations 
of the entire plurality of adjacent keys detected during said key activation event, and said 
determined location is appended to the current input sequence of the locations of the key 
activation events. Most preferably, the function used to determine the location of said key 
activation event comprises the computation of the location corresponding to the center of the 

25 locations of the simultaneously activated keys. Further, the function used to determine the 

location of said key activation event comprises the computation of the location corresponding 
to the weighted center of gravity of the locations of the simultaneously activated keys, wherein 
the weights associated with each of the keys in the auto-correcting keyboard region correspond 
to the relative frequencies of occurrence of the characters associated with the keys, wherein 

30 said relative frequencies are determined with respect to the frequencies of occurrence of the 
characters in the objects in memory. 

Preferably, the auto-correcting keyboard region includes one or a plurality of keys 
associated with one or a plurality of punctuation characters, wherein said memory includes one 
or a plurality of objects in memory which include one or a plurality of the punctuation 

35 characters associated with keys in said auto -correcting keyboard region. Alternatively, the 
word evaluation component calculates the matching metric for each candidate object by 
summing the distance values calculated from determined location in the input sequence to the 
known location of the key corresponding to the character in the corresponding position of the 
candidate object, and applying a weighting function according to the frequency of use 
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associated with the object. In another embodiment, at least one of the keys in the auto- 
correcting keyboard region corresponds to a plurality of characters, one or a plurality of which 
include various diacritic marks, wherein the plurality of characters comprise variant forms of a 
single base character, and wherein objects in memory are stored with their correct accented 
5 characters. 

Preferably, the selection component presents the identified one or a plurality of 
candidate objects for selection by the user in a candidate object list in the text display area. 
Most preferably, the selection component identifies the highest ranked candidate object and 
presents the identified object in the candidate object list in the position nearest to the auto- 

10 correcting keyboard region. Further, activation of a key that is associated with a character, 
wherein the key is not included within the auto-correcting keyboard region, accepts and 
outputs the determined highest ranked candidate object at a text insertion point in the text 
display area prior to outputting the selected character at the text insertion point in the text 
display area. Further, the user selection of an object for output at a text insertion point in the 

1 5 text display area terminates the current input sequence such that the next key activation event 
within the auto-correcting keyboard region starts a new input sequence. 

The present invention further provides a process for auto-correcting text entry system 
comprising: 

(a) providing a user input device comprising a touch sensitive surface including an 
20 auto-correcting keyboard region comprising a plurality of the characters of an alphabet, 

wherein each of the plurality of characters corresponds to a location with known coordinates in 
the auto-correcting keyboard region, wherein each time a user contacts the user input device 
within the auto-correcting keyboard region, a location associated with the user contact is 
determined and the determined contact location is added to a current input sequence of contact 
25 locations; 

(b) providing a memory containing a plurality of objects, wherein each object is a 
string of one or a plurality of characters forming a word or a part of a word, wherein each 
object is further associated with a frequency of use; 

(c) providing an output device with a text display area; and 

30 (d) providing a processor coupled to the user input device, memory, and output 

device, said processor comprising: 

(i) a distance value calculation component which, for each determined 
contact location in the input sequence of contacts, calculates a set of distance values between 
the contact locations and the known coordinate locations corresponding to one or a plurality of 

35 characters within the auto-correcting keyboard region; 

(ii) a word evaluation component which, for each generated input sequence, 
identifies one or a plurality of candidate objects in memory, and for each of the one or a 
plurality of identified candidate objects, evaluates each identified candidate object by 
calculating a matching metric based on the calculated distance values and the frequency of use 
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associated with the object, and ranks the evaluated candidate objects based on the calculated 
matching metric values; and 

(iii) a selection component for (a) identifying one or a plurality of candidate 
objects according to their evaluated ranking, (b) presenting the identified objects to the user, 
5 enabling the user to select one of the presented objects for output to the text display area on the 
output device. 

Preferably, the selection component further comprises (c) resetting the current input 
sequence of the locations of the points of contact to an empty sequence upon detecting the 
selection by the user of one of the presented objects for output to the text display area on the 
1 0 output device. 

Brief Description of the Drawings 

Figure 1 A is a schematic view of a preferred embodiment of a portable computer 
incorporating a reduced keyboard system of the present invention which automatically corrects 
input keystrokes; Figure IB is the same schematic view showing an embodiment of a word 
15 choice list that is displayed after the user has entered a sequence of keystrokes within the auto- 
correcting keyboard region. 

Figure 2 is a hardware block diagram of the reduced keyboard system of Figures 1 A 
and IB. 

Figure 3 is a schematic view of a preferred embodiment of an auto-correcting keyboard 
20 region of the reduced keyboard system of the present invention which automatically corrects 
input keystrokes, showing its division into three clustering regions and three example contact 
points. 

Figures 4A through 4K show a flow chart of a preferred embodiment of software to 
determine the intended text to be generated in response to an input sequence of keystrokes. 

25 Figures 5A through 5E are schematic views showing a sequence of character inputs as 

an illustrative example of entering a word on a preferred embodiment of a portable computer 
incorporating a reduced keyboard system of the present invention. 
Detailed Description of the Invention 

Since user keystroke entries are presumed to be possibly inaccurate, there is some 

30 ambiguity as to how a particular sequence of keystrokes should be interpreted in order to 
generate the sequence of characters that the user intended to type. The present invention 
provides a process and system (i.e., apparatus or device having the inventive software within) 
wherein the user is presented with one or more alternate interpretations of each keystroke 
sequence corresponding to a word such that the user can easily select the desired interpretation, 

35 and wherein no special action need be taken to select the interpretation deemed most likely. 

This approach enables the system to utilize the information contained in the entire sequence of 
keystrokes corresponding to a word in resolving what the user's likely intention was for each 
character of the sequence. 
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The method of the present invention has two very significant advantages over prior 
systems, such as that disclosed by U.S. Patent 5,748,512. One is that the inventive system is 
able to utilize information regarding both preceding and succeeding keystrokes in determining 
the intended character for each keystroke, together with the length of the word and a database 
5 that includes information as to the relative frequencies of potentially matching words. This is 
far more information than can be utilized by prior systems, and greatly increases the 
performance of the system. The second advantage is that the user need only interact and 
respond to predictions by the system at word boundaries, after all the characters of each word 
have been entered, rather than having to examine, and accept or reject each character generated 

10 by the system immediately following each keystroke. This greatly enhances the usability of 
the system, since the user is thus able to focus much more on the entry of text on the keyboard 
without needing to constantly divert his attention to the display following each keystroke. 
Another advantage is the system also accommodates punctuation characters such as a hyphen 
or apostrophe that are commonly embedded in words such as hyphenated compounds and 

1 5 contractions in English. Such embedded punctuation characters can be associated with one or 
more keys or character locations included among those associated with alphabetic characters. 
Definitions 

"Keyboard" shall mean include any input device having defined areas including, but 
not limited to, a touch sensitive screen having a defined area containing a plurality of defined 
20 locations associated with characters, a touch sensitive screen having defined areas for keys, 
discrete mechanical keys, or membrane keys. 

"Auto-correcting keyboard region" refers to that region of the keyboard having the 
inventive auto-correcting process and features applied. 

"Object" shall mean a string of one or more characters forming a word or part of a 
25 word, including without limitation, wordstems, prefixes and suffixes. 

"Word" shall mean a string of one or more characters that is used in a given language, 
also including without limitation acronyms, abbreviations and proper nouns. 

"Wordstem" shall mean a "root word" or "root stem." For example, the word 
"interestingly" consists of the root word "interest" to which the suffix "ingly" has been 
30 appended. As a further example, the word "surprisingly" consists of the root stem "surpris" to 
which the suffix "ingly" has been appended. 

"Module" is a logical organization of objects based upon characteristics of the objects. 
For example, (1) words of the French language versus words of the English language are 
arranged in different modules, (2) a verb stem module contains verb stems to each of which 
35 one or more possible suffixes may be appended, wherein the suffixes are contained in one or 
more suffix modules associated with the verb stem module, wherein the suffixes from the 
suffix modules can be appended to a verb stem in the verb stem module to form a properly 
inflected word. 
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A "contact action" comprises the entirety of a user action which results in a contact 
with the keyboard, starting from the first point and moment of contact, and including any 
additional contiguous points of contact that are detected up to the moment at which contact 
with the keyboard is terminated. Examples of contact actions include, but are not limited to, 
5 physically touching a touch sensitive screen with a stylus or finger and moving the stylus or 
finger to a greater or lesser degree prior to lifting the stylus or finger from the screen, or 
moving a mouse cursor to position it within a displayed keyboard region and clicking the 
mouse button then moving the mouse cursor to a greater or lesser degree prior to releasing the 
mouse button. 

10 A "contact location'' is the location determined to correspond to a user action which 

results in a contact with the keyboard. Methods of determining a contact location include, but 
are not limited to, detecting the point on a touch screen or similar device where the initial or 
final contact was made by the user, or detecting a user action such as a mouse click whereby 
the contact location is determined corresponding to the location of the mouse cursor within a 

15 displayed keyboard region at the time of that the mouse button is first depressed. 

A "distance value calculation component" calculates a set of distance values between a 
contact location and the known coordinate locations corresponding to one or more characters 
within the auto-correcting region of the keyboard. The method used to calculate the distance 
between two locations includes, but is not limited to, assigning Cartesian coordinates to each 

20 location and calculating the distance between two locations according to standard Cartesian 
coordinate distance analysis, assigning Cartesian coordinates to each location and calculating 
the distance between two locations as the square of the standard Cartesian coordinate distance, 
and assigning Cartesian coordinates to each location and calculating the distance between two 
locations according to Cartesian coordinate distance analysis where the vertical component is 

25 adjusted by a weighting factor. 

A "matching metric" is a score calculated for an object with respect to an input 
sequence of contact locations as a means to estimate how likely it is that the object corresponds 
to the user's intention in performing the input sequence of contacts. For example, a matching 
metric can be calculated in one embodiment as the sum of the squares of the distances from 

30 each contact location in the entry sequence to the location assigned to the character in the 
corresponding position of a given candidate object, and then multiplying the sum of the 
squared distances by a frequency adjustment factor, which in one preferred embodiment is 
calculated as the base 2 logarithm of the ordinal position of the word with respect to other 
potential candidate objects, where objects associated with higher relative frequencies 

35 correspond to lower ordinal positions {i.e., the most frequent object is at position "1"). Thus, 
in this embodiment, the lower the numeric value of the calculated matching metric, the more 
likely a given object is deemed to correspond to the user's intention in generating a sequence 
of contact points. 
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An "evaluated ranking" is the relative prioritization of a set of candidate objects 
according the likelihood that each of the objects corresponds to the user's intention in 
generating a sequence of contact points, where this likelihood is determined according to the 
matching metric calculated for the objects. 
5 A "key activation event" includes but is not limited to an event that comprises the 

entirety of the activated keys detected during the course of a user action which results in the 
activation of one or more adjacent keys of a mechanical keyboard, starting from the first 
depressed key and including the time at which it was depressed, and including any additional 
keys that are adjacent to the first depressed key and that are simultaneously depressed, detected 
10 up to the moment at which neither the first key nor any simultaneously depressed adjacent key 
is depressed. 

With regard to Figure 1A, a reduced auto-correcting keyboard system 100 formed in 
accordance with the present invention is depicted incorporated in a palmtop portable computer 
102. Portable computer 102 contains a reduced keyboard 105 implemented on a touch screen 

1 5 display 103, which is used to generate text to be output to text display region 104. For 
purposes of this application, the term "keyboard" is defined broadly to include any input 
device having defined areas including, but not limited to, a touch screen having defined areas 
for keys, discrete mechanical keys, or membrane keys. Keyboard 1 05 has an auto-correcting 
keyboard region 106 wherein the 26 letters of the English alphabet plus an apostrophe are 

20 displayed in approximately the standard "QWERTY" arrangement. In this preferred 

embodiment, it is relevant to note that the aspect ratio of the keyboard 106 (the ratio of its 
width to its height) is less than 2:1, whereas for a standard computer keyboard or typewriter, 
this ratio is approximately 4: 1 . This aspect ratio renders the keyboard 1 06 easier to use in that 
the less elongated shape tends to minimize the distance the stylus must be moved between 

25 letters at opposite ends of the keyboard, while enhancing the system's ability to distinguish 
between letters in adjacent rows by increasing their relative separation. This makes it easier 
for the user to contact the keyboard in a location that is relatively close to the intended letter in 
the vertical dimension. Consequently, in a preferred embodiment, the distance of a contact 
point from a letter is calculated using a method that increases the relative weight of the vertical 

30 component of the distance with respect to the horizontal component. 

The keyboard may be of any size, very small or very large. For example, an 
implementation using a space for the auto-correcting keyboard as small as 1 cm by 0.5 cm that 
includes all 26 letters of the English alphabet has been found to be quite usable with a small 
plastic stylus. When embodied as a keyboard of this size, a well-known key arrangement such 

35 as the standard "QWERTY" layout may be used. With this key arrangement it is not necessary 
to include legible displayed characters, because the relative locations of each character in the 
defined keyboard space are well known to users familiar with such a standard layout. 
Alternatively, a very small label such as a dot may be displayed at each character location to 
aid the user. 
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In accordance with another aspect of the invention, the internal, logical representation 
of the characters need not mirror the physical arrangement represented by the labels on the 
actual characters in the auto-correcting keyboard. For example, in a database constructed to 
represent a French vocabulary module, the accented characters A and A may also be associated 
5 with the unaccented character A that appears as an explicitly labeled key in a mechanical 

keyboard or character location in a keyboard implemented on a touch-sensitive display screen. 
The word entries in the French vocabulary module include the necessary information to 
determine whether a given word is spelled with an accented or unaccented character so that the 
correctly spelled form can be automatically generated based on an input contact point 

1 0 sufficiently near the key or character location associated with the unaccented character. This is 
a significant advantage for languages such as French that often use accented characters since 
no special typing techniques, additional keys, or additional keystrokes are required to type 
words using their correct spellings including appropriate accents. 

The preferred keyboard of Figure 1 A contains six additional keys associated with the 

15 execution of specific functions or the generation of specific characters. These keys include a 
Shift key 108, a Space key 1 10, a Backspace key 1 12, an Edit Word key 1 14, a Symbols Mode 
key 1 16, a Return (or "Enter") key 1 18, an Alternate Keyboard Mode key 120, and a Numeric 
Mode key 122. The function of these keys will be discussed in conjunction with Figure IB. 
Text is generated using the keyboard system via keystrokes on the auto-correcting 

20 keyboard 106. As a user enters a keystroke sequence using the keyboard, text is displayed on 
the computer display 103. Two overlapping regions are defined on the display each of which 
display information to the user. An upper output text region 104 displays the text entered by 
the user and serves as a buffer for text input and editing. A word choice list region 150, which 
in a preferred embodiment shown in Figure IB is super-imposed on top of the text region 104, 

25 provides a list of words and other interpretations corresponding to the keystroke sequence 

entered by a user. The word choice list region 150 aids the user in correcting inaccuracies in 
the entered keystrokes. In another embodiment, the system may be implemented on a device 
with limited display space, and display only the Default or most likely word object at the 
insertion point 107 in the text being generated. 

30 In another preferred embodiment, the keyboard of the present invention is implemented 

using a keyboard device. Examples of such devices would include the standard desktop 
keyboard used with personal computers, or much smaller mechanical keyboards such as those 
commonly used in portable, hand-held electronic devices such as two-way pagers that, in the 
case of English and other Latin-alphabet based languages, include a full set of at least 26 small, 

35 closely spaced keys, often arranged in the standard "QWERTY" layout. This embodiment is 
logically identical to a touch screen implementation in which the points at which physical 
contact with the screen can be detected correspond exactly to the set of coordinates associated 
with the characters of the keyboard. Thus, the methods of the present invention may be 
applied equally well to such mechanical keyboard implementations. 
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However, mechanical keyboards have a distinguishing characteristic from touch 
screens in that a single inaccurate or erroneous key activation may not only consist of 
activating a key other than the one intended, but also may consist of simultaneous or closely 
sequential activation of two or more adjacent keys, wherein the activated keys may or may not 
5 include among them the intended key. Thus, in accordance with another aspect of the 
invention, a sequence of keystrokes on the auto-correcting keyboard is filtered through a 
window of both time and space, in that a single intended keystroke may activate more than one 
adjacent key. An example is when a user depresses 2, 3 or 4 keys when the user's finger is not 
properly aligned with the intended key or any single specific key. Thus, following each 

1 0 received keystroke, the keystroke is not processed until after the system waits for a very brief 
timeout threshold, or until a keystroke is received on a non-adjacent key. If the next keystroke 
occurs on an adjacent key, or if multiple keystrokes occur on adjacent keys, before the 
expiration of the timeout threshold, the detected keys are regarded as a single keystroke event. 
In such cases, a "virtual" point of contact is calculated at the center of the set of 

15 "simultaneously" activated keys. The distances from this calculated "virtual" contact point and 
the known character locations (key locations in this case of a mechanical keyboard) are 
calculated by interpolating to a logical coordinate frame with a finer resolution than that of the 
physical sensors (keys). 

In another embodiment of the invention, keystrokes on the auto-correcting keyboard are 

20 not matched to characters in isolation, but rather entire sequences of keystrokes corresponding 
to completed words are matched against a lexicon of candidate words that includes information 
regarding the relative frequency with which each word appears in a representative corpus of 
usage. In this way, the system is often able to correctly compensate for occasional keystroke 
errors of a larger than average magnitude, or even multiple errors of a relatively larger 

25 magnitude when the intended word is of high relative frequency. This word-level analysis of 
keystroke input sequences are a key factor in enabling the inventive system to flexibly 
accommodate user keystroke errors. 

The word-level analysis of keystroke sequences enables the system to generate 
punctuation characters such as a hyphen or apostrophe that are commonly embedded in words 

30 such as hyphenated compounds and contractions in English. Such embedded punctuation 

characters can be associated with one or more keys or character locations included in the auto- 
correcting keyboard among those associated with alphabetic characters. When more than one 
punctuation character is associated with a single key, the specific punctuation character 
intended can be disambiguated based on the information included in the lexicon. Thus, for 

35 example, if a word in the lexicon includes an apostrophe in a position corresponding to a key 
contact in the region of an ambiguous punctuation key, the matching algorithm will 
automatically identify the associated word and disambiguate the keystroke as an apostrophe. 
Simultaneously, the system can separately analyze the keystroke sequences preceding and 
following the key contact in the region of the punctuation key to determine the most likely 
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matching words in the lexicon and calculate the likelihood that a hyphenated compound was 
intended. In the case of a keyboard implemented on a touch-sensitive display screen, other 
symbols, numbers or other characters not commonly used are relegated to a separate symbol 
selection scheme, preferably through presentation in a series of temporarily displayed tables. 
5 Such Symbol Tables are preferably accessed through a function key or touch entry element 
assigned adjacent to the auto-correcting keyboard region. In the case of a mechanical 
keyboard, these other symbols, numbers and uncommon characters are often accommodated 
through additional keys not included in the auto-correcting keyboard. 

In accordance with another aspect of the invention, candidate words that match the 

10 input sequence are presented to the user in a word selection list on the display as each input is 
received. In accordance with another aspect of the invention, the word interpretations are 
presented in the order determined by the matching metric calculated for each candidate word, 
such that the words deemed to be most likely according to the matching metric appear first in 
the list. Selecting one of the proposed interpretations of the input sequence terminates an input 

1 5 sequence, so that the next keystroke inside the auto-correcting keyboard region starts a new 
input sequence. In accordance with yet another aspect of the invention, only a single word 
interpretation appears on the display, preferably at the insertion point for the text being 
generated. The word interpretation displayed is that deemed to be most likely according to the 
matching metric. By repeatedly activating a specially designated selection input, the user may 

20 replace the displayed word with alternate interpretations presented in the order determined by 
the matching metric. An input sequence is also terminated following one or more activations 
of the designated selection input (effectively selecting exactly one of the proposed 
interpretations of the sequence for actual output by the system), so that the next keystroke 
inside the auto-correcting keyboard region starts a new input sequence. 

25 In accordance with another aspect of the invention, for each input sequence of contact 

points, a word is constructed by identifying the character nearest each contact point and 
composing a word consisting of the sequence of identified characters. This "Exact Type" word 
is then presented as a word choice in the word selection list. This word may then be selected in 
the usual manner by, for example, touching it in the word selection list. Exact Type entries can 

30 be edited by, for example, pressing a backspace key to delete one character at a time from the 
end of the word. Once the user selects the Exact Type word, it is automatically "accepted" for 
output and is added to the information being composed. When so selected, the Exact Type 
string may be added as a candidate for inclusion into the lexicon of words so that in the future 
it can be typed using the auto-correcting keyboard without needing to precisely contact each 

35 letter of the word as is necessary in first entering an Exact Type entry. 

Figure IB shows a preferred embodiment of a word choice list 150 that is displayed 
after the user has entered a sequence of keystrokes within the auto-correcting keyboard region 
106. The word choice list includes a Cancel key 152, wherein contacting the Cancel key 
causes the system to discard the current input sequence, clearing the word choice list and 
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causing the system to restore the display of any text or graphics obscured by the appearance of 
the word choice list. The "Exact Type" word 154 shows the sequence of letters closest to the 
actual contact points of the input sequence, whether or not these correspond to any word in any 
vocabulary module. In the example shown in Figure IB, the Exact Type word "rwzt" does not 
correspond to an English word. In a preferred embodiment, selecting the Exact Type word for 
output results in the automatic addition of that word to the appropriate vocabulary module if it 
is not already included. The Default word 160 ("text" in the example of Figure IB) is the word 
from the vocabulary modules determined to have the lowest value of the matching metric (that 
is, the more likely the word corresponds to the user's intention), and in a preferred 
embodiment, is shown at the bottom of the word choice list, nearest to the auto-correcting 
keyboard region 106. Similarly, three alternate word choices 157 are shown in the list in an 
order determined by their corresponding matching metric values. 

The Symbols Mode key 116, the Alternate Letter Mode key 120, and the Numeric 
Mode key 122 each cause a corresponding keyboard of punctuation and symbols, alphabetic 
letters, and numeric digits, respectively, to appear on the display screen. The user can then 
select the desired character or characters from the displayed keyboard. If a word choice list 
was displayed prior to displaying such an alternate keyboard, the selection of any character 
from the displayed alternate keyboard causes the Default word of the previously displayed 
word choice list to be output to the output text region 104 prior to outputting the selected 
character from the alternate keyboard. Similarly, if a word choice list was displayed prior to 
contacting the Space key 1 10 or the Return key 118, the Default word 160 is flushed to the 
output text region 104 prior to generating a single space or carriage return character, 
respectively. 

In the preferred embodiment, the Shift key 108 functions as a latching Shift key, such 
that contacting it causes the letter associated with the next contact in the auto-correcting 
keyboard 106 to be generated as an upper-case letter. In another preferred embodiment, two 
successive contacts on the Shift key 108 puts the system in "Caps-Lock," and a subsequent 
activation cancels "Caps-Lock" mode. Backspace key 112 deletes the last input contact from 
the current sequence of contacts if one exists, and otherwise deletes the character to the left of 
the cursor at the insertion point 107 in the output text region 104. When no current input 
sequence exists, a contact on the Edit Word key 1 14 causes the system to establish a current 
input sequence consisting of the coordinate locations associated with the letters of the word 
that contains the insertion point cursor 1 07 or is immediately to the left of this cursor in the 
output text region 104. The result is that this word is "pulled in" to the system creating a word 
choice list in which the word appears both as the Default word 160 and the Exact Type word 
154. 

A block diagram of the reduced keyboard disambiguating system hardware is provided 
in Figure 2. The touch screen 202 and the display 203 are coupled to a processor 201 through 
appropriate interfacing circuitry. Optionally, a speaker 204 is also coupled to the processor. 
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The processor 201 receives input from the touch screen, and manages all output to the display 
and speaker. Processor 201 is coupled to a memory 210. The memory includes a combination 
of temporary storage media, such as random access memory (RAM), and permanent storage 
media, such as read-only memory (ROM), floppy disks, hard disks, or CD-ROMs. Memory 
5 210 contains all software routines to govern system operation. Preferably, the memory 

contains an operating system 211, auto-correction software 212, and associated vocabulary 
modules 213 that are discussed in additional detail below. Optionally, the memory may 
contain one or more application programs 214, 215, 216. Examples of application programs 
include word processors, software dictionaries, and foreign language translators. Speech 

10 synthesis software may also be provided as an application program, allowing the reduced auto- 
correcting keyboard system to function as a communication aid. 

In accordance with another aspect of the invention, each input sequence is processed 
with reference to one or more vocabulary modules, each of which contains one or more words 
together with information about each word including the number of characters in the word and 

15 the relative frequency of occurrence of the word with respect to other words of the same 

length. Alternatively, information regarding the vocabulary module or modules of which a 
given word is a member is stored with each word. Each input sequence is processed by 
summing the distances calculated from each contact point in the entry sequence to the location 
assigned to the letter in the corresponding position of each candidate word, wherein the 

20 distances are calculated according to one of the preferred methods. This total distance is 
combined with the frequency information regarding each candidate word to calculate a 
matching metric by which the various candidate words are rank ordered for presentation to the 
user. In one preferred embodiment, the matching metric is calculated as follows. The square 
of the distance from each contact point in the entry sequence to the location assigned to the 

25 letter in the corresponding position of each candidate word is calculated, and the sum of the 
squared distances is calculated for each candidate word. This sum is then multiplied by a 
frequency adjustment factor, which in one preferred embodiment is calculated as the base 2 
logarithm of the ordinal position of the word in the candidate list, where words of higher 
relative frequency are moved "higher" in the list to positions corresponding to a lower ordinal 

30 position (i.e., the most frequent word is at position "1"). Thus, the lower the numeric value of 
the calculated matching metric, the more likely a given word is deemed to correspond to the 
user's intention in generating a sequence of contact points. 

In another aspect of the invention, prior to multiplying the sum of the distances (from 
each contact point in the entry sequence to each corresponding letter in a candidate word) by 

35 the frequency adjustment factor, a fixed increment is added to the sum so that it is at least 
greater than or equal to this increment value. This is done to avoid calculating a matching 
metric value of zero (i.e., the most likely match) when the sequence of contact points happens 
to correspond exactly to the spelling of a given word, even when that word occurs with very 
low frequency (i.e., has a high numeric ordinal position). This allows much more frequently 
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occurring words to produce a better matching metric even when an inaccurate sequence of 
contact points is entered. In one implementation of this preferred embodiment, it was found 
that a fixed increment value of approximately twice the average distance between adjacent 
characters in the keyboard was effective in reducing spurious matches with infrequent words. 
5 In accordance with another aspect of the invention, words in each vocabulary module 

are stored such that words are grouped into clusters or files consisting of words of the same 
length. Each input sequence is first processed by searching for the group of words of the same 
length as the number of inputs in the input sequence, and identifying those candidate words 
with the best matching metric scores. In accordance with another aspect of the invention, if 

10 fewer than a threshold number of candidate words are identified which have the same length as 
the input sequence, and which have a matching metric score better than a threshold value, then 
the system proceeds to compare the input sequence of N inputs to the first N letters of each 
word in the group of words of length N+l . This process continues, searching groups of 
progressively longer words and comparing the input sequence of N inputs to the first N letters 

15 of each word in each group until the threshold number of candidate words are identified. 

Viable candidate words of a length longer than the input sequence may thus be offered to the 
user as possible interpretations of the input sequence, providing a form of word completion. In 
another aspect of the invention, prior to multiplying the sum of the distances (from each 
contact point in the entry sequence to each corresponding initial letter in a candidate word 

20 whose length is greater than the length of the current input sequence) by the frequency 

adjustment factor, a second fixed increment is added to the sum so that it is greater than the 
distance sum that would be calculated for a word whose length corresponds exactly to the 
length of the current input sequence. This is done to assign a relatively higher matching 
probability to words whose length does correspond exactly. In another preferred embodiment, 

25 this second increment factor is a function of the difference in length between the candidate 
word and the current input sequence. 

In accordance with another aspect of the invention, in order to increase the efficiency 
with which the vocabulary modules can be processed, each character mapped onto the touch- 
sensitive pad in the auto-correcting keyboard region is assigned a boundary of exclusion. Each 

30 such boundary identifies the region beyond which the distance from the contact point to the 
character will not be calculated and the character will be removed from consideration for that 
contact point in the input sequence, reducing the computation required by the distance 
calculation process. The boundaries of exclusion for several characters may share some or all 
common boundary segments. Examples of shared boundaries include the extreme edge of the 

35 auto-correcting keyboard region, or gross boundaries drawn through the character space to 
subdivide the auto-correcting keyboard into 2, 3 or more major clustering regions. 
Conceptually, it is identical to consider the boundary of exclusion for a given contact point, 
where any character outside that boundary is excluded from consideration as a match for that 
input point. For example, Figure 3 shows an auto-correcting keyboard region 300 that consists 
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of a horizontal rectangle which is divided vertically into three clustering regions 301, 302, 303 
of approximately equal size, where these regions are defined such that each character falls into 
only one of the three clustering regions. The clustering regions are defined such that for each 
contact point in the auto-correcting keyboard region, at least one and frequently two of the 
5 three regions lie completely outside the boundary of exclusion for that contact point. For 

example, a contact point 31 1 at the left side of region 301 is far enough away from region 302 
that all characters in region 302 (and region 303) could be defined to lie outside the boundary 
of exclusion for contact point 311. In contrast, the boundary of exclusion for contact point 312 
at the right side of region 301 would extend into region 302 such that one or more characters in 

10 region 302 would be considered to lie inside the boundary, so that the only region completely 
outside the boundary of exclusion for contact point 312 would be region 303. The boundary of 
exclusion for contact point 313 in the center of region 302 could be considered to be far 
enough away from both region 301 and region 303 that all characters in both these regions 
could be defined to lie outside the boundary. 

15 Such clustering regions are then used to increase the efficiency with which the system 

can identify the most likely matching words in one or more vocabulary modules for a given 
input sequence of contact points. Continuing the example described above and depicted in 
Figure 3, the words of a given length in each vocabulary module can be divided up into nine 
different sub-groups based on the clustering regions in which each of the first two letters of 

20 each word is found, since there are nine possible ordered pairs of such regions. Note that 
processing words of only one letter need not be optimized since very little calculation is 
required and there are only a very small number of one-letter words, even when every letter is 
treated as if it is a one-letter "word." For each of the first two contact points, letters in one or 
two of the clustering regions can be eliminated from consideration, so that all of the words in 

25 the sub-groups associated with letters in the eliminated regions can be skipped over without 

needing to perform any distance calculations. Thus, assuming a more or less equal distribution 
of total character frequency among the three regions for the first two character positions of 
words in the vocabulary modules, upon the receipt of the second contact point, the system need 
only calculate distances for and compare at most 4/9 of the candidate words (when only one 

30 clustering region is eliminated from consideration for each contact point) to as few as 1/9 of 
the candidate words (when two clustering regions are eliminated for each contact point). As 
would be obvious to one of ordinary skill in the art, this method can be used with a greater or 
lesser number of clustering regions, and for different numbers of initial contact points, with 
corresponding results. For example, four clustering regions could be used to divide candidate 

35 words into sixteen sub-groups based on the first two contact points. 

In another embodiment of the invention, a subset of characters or functions will be 
associated with uniquely defined regions or keys outside the auto-correcting keyboard, where 
entries within these regions are interpreted as explicit entries of a specific character or 
function, for example, a Space key that unambiguously generates a single space when selected. 

24 



WO 00/74240 



PCT/US00/14684 



For a defined set of such keys, selecting such a key immediately following an input sequence, 
and prior to performing an explicit selection of any of the interpretations offered by the system 
for the input sequence, results in the automatic acceptance of the interpretation of the input 
sequence deemed to be most likely according to the matching metric calculated for each 
5 candidate word. The input sequence is terminated, so that the next keystroke inside the auto- 
correcting keyboard region starts a new input sequence. Once the desired word interpretation 
of an input sequence has been determined and the sequence is terminated, the system 
automatically outputs the word so that it is added to the information being constructed. In the 
case of certain functions, for example the backspace function, an entry within the associated 

1 0 region is interpreted as an explicit entry of the backspace function, which is immediately 

executed. The result in this case however, does not terminate the input sequence, but simply 
deletes the last (most recent) input from the sequence. In general, keys outside the auto- 
correcting keyboard are immediately interpreted and acted upon according to the unique 
character or function associated with the key. Depending on the associated character or 

15 function, in some cases the current input sequence is terminated as a result. 

In accordance with another aspect of the invention, the system distinguishes between 
two different types of contact events that occur in the area of the touch screen or touch- 
sensitive display that is used to display the auto-correcting keyboard region and other uniquely 
defined regions or keys outside the auto-correcting keyboard. One type of contact consists of a 

20 "tap" event, wherein the touch screen is contacted and then the contact is terminated without 
moving beyond a limited distance from the initial point of contact. This type of event is 
processed as a keystroke intended for the reduced auto-correcting keyboard system as 
described in this disclosure. The second type of contact consists of a "stroke" event, wherein 
the touch screen is contacted then the point of contact is moved in one or more directions 

25 beyond the limited distance threshold used to define a tap event. This second type of contact 
can then be processed by using a stroke recognition system using techniques that are well 
known in the art. This allows a number of additional functions or special characters to be 
made easily accessible to the user without having to trigger pull-down menus or define 
additional keys that would either require additional screen space or reduce the size of the keys 

30 provided. The interpretation of such strokes and the resulting character or function associated 
with a recognized stroke is then processed by the system in the same fashion as an activation of 
one of the uniquely defined regions or keys outside the auto-correcting keyboard. In this way, 
only a limited area of the available touch screen or touch-sensitive display needs to be used to 
accommodate both keyboard-based and stroke recognition input approaches. 

35 In accordance with another aspect of the invention, the system performs additional 

processing of "tap" events to dynamically adjust to a particular user's style of contacting the 
touch screen or touch-sensitive display. For a user who contacts the display with a stylus 
"gesture" that closely approximates a point (z'.e., the stylus contacts the display and is lifted 
before being moved any appreciable distance), there is no significant ambiguity as to where the 
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user intended to contact the keyboard. However, when the stylus moves to a greater or lesser 
extent before being lifted from the display, there is ambiguity as to which point contacted 
during the duration of the stylus "gesture" best represents the point that the user intended to 
contact - the initial point of contact, the final point where the stylus was lifted, or some other 
5 point along the path of contact of the stylus. In a preferred embodiment, the system records the 
path traced out by each contact as a set of N contact points that include the endpoints of the 
path of contact and zero or more points equally spaced points along the path. For example, in 
an embodiment where N is set to 3, the set would include the endpoints and the midpoint of the 
path. Initially, one point of each set of recorded points is designated as the coordinate to be 

10 used to represent the point of contact in calculating distances to letters in the auto-correcting 
keyboard. For example, in a preferred embodiment, the initial point of contact recorded in 
each set is designated as the coordinate to be used to represent the point of contact in all 
distance calculations. Each time a word is selected for output from the word choice list, the 
distance calculation is repeated from each letter in the chosen word to each of the other sets of 

1 5 recorded points. For example, when N is set to 3 and the calculation is performed for the 

starting point of contact, the calculation is repeated for the set of midpoints and for the set of 
endpoints. Whichever set of points results in the minimum calculated distance is then 
designated as the set of points whose coordinates are to be used to represent the sequence 
points of contact in all subsequent distance calculations in creating the word choice list for 

20 each sequence of contacts. In another preferred embodiment, a running average is computed of 
the distance calculations performed for each point set, and the point set for which this running 
average is lowest is used for distance calculations in creating the word choice list. In another 
preferred embodiment, the running average is computed of the horizontal and vertical signed 
offset of the designated points from the intended target letters, and the coordinates used in 

25 distance calculations for creating the word choice list are adjusted by this average signed 
offset, or by a fixed fraction of this offset. Thus, if a user consistently contacts the touch 
screen somewhat to the left and below the intended letters, the system can automatically adjust 
and more accurately predict the intended word on average. 

In accordance with another aspect of the invention, each character mapped onto the 

30 touch-sensitive pad in the auto-correcting keyboard region is assigned a region of territory. 

Each such region identifies an area wherein the distance from the user entry to the character is 
assigned the value of zero, simplifying the distance calculation process. The size of such 
regions can vary for different characters and functions. For example, a larger region may be 
assigned to a character that occurs with a relatively higher frequency within a representative 

35 corpus of usage. In one embodiment, this region assigned to a character simply corresponds to 
a defined key that is clearly circumscribed and demarcated on the touch screen, and to which 
the character is assigned. 

The operation of the reduced auto-correcting keyboard system is governed by the auto- 
correction software 212 which is based in part on the distance between the contact point and 
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the various candidate characters. In one embodiment of the invention, each character in the 
auto-correcting keyboard is assigned a Cartesian coordinate to simplify the calculation of the 
distance to keystrokes. The distance between the contacted point and the various candidate 
character locations in the auto-correcting keyboard, therefore, is calculated through simple 
5 Cartesian coordinate distance analysis. In another embodiment of the invention, the square of 
the simple Cartesian coordinate distance is used to both simplify calculation (because no 
square root need be calculated) and to apply a non-linear weighting to more distant contact 
points. In other embodiments, the distance calculation also utilizes other nonlinear functions 
such as natural logarithms or discrete steps, either exclusively or in an appropriately weighted 

1 0 combination with Cartesian coordinate distance analysis. 

Also, in another preferred embodiment the distances in the x and y directions are 
weighted differently. Such modifications to the distance calculation can serve to simplify word 
selection, reduce processing requirements, or to accommodate systematic entry anomalies 
depending on the particular needs of a given system and its implementation. For example, in 

1 5 the case of a touch screen display panel on which a QWERTY keyboard arrangement is 

displayed with three rows of character keys, it is generally less likely that a significant error 
will be made in contacting the keyboard in the wrong row. In this case, vertical distance along 
the y-axis may be weighted more heavily than horizontal distance along the x-axis. 

Additionally, the spacing between characters on the auto-correcting keyboard may be 

20 non-uniform, depending on how frequently any given character is used in a representative 
corpus of the language, or on its location relative to the center or the edge of the keyboard. 
Alternatively, when sufficient computational resources are available, one or more characters 
may be assigned a plurality of corresponding coordinate locations to be used as reference 
points in calculating the distance of the key from the coordinate location of a point at which the 

25 keyboard was contacted, wherein that distance is calculated as the distance from the contacted 
point to the closest such assigned reference coordinate. This will tend to decrease the 
calculated distances to points at which the display is contacted in a non-linear fashion, and 
consequently increase the size of the region surrounding the character in which there is a high 
likelihood that a sequence of contacts that includes a contact point in the region will match a 

30 word with the character in the corresponding position. 

Also, the coordinates of the characters may be assigned to locations not shared by 
keyboard keys or directly detectable by sensors. This allows the system to be implemented 
using less costly touch screens or keyboards having a lower resolution in detecting points of 
contact. It also allows the keyboard to be reconfigured according to the user's wishes but still 

35 utilizing the same physical keyboard touch screen or sensor array. For example, the common 
three rows of characters in the QWERTY layout may be served with 1, 2, 3 or more rows of 
sensors to reduce the keyboard's mechanical complexity or to allow dynamic reassignment of 
novel keyboard arrays. An example would be changing from 3 rows of 9 characters to 4 rows 
of 7 characters. The assigned coordinate location for a given character key may thus lie 
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between the nearest two or more points at which physical contacts with the keyboard may be 
detected and resolved. In this case, the distance from a detectable contact point and the 
assigned character location is calculated by interpolating to a logical coordinate frame with a 
finer resolution than that of the physical sensors. 
5 In one preferred embodiment of the system, additional processing optimization is 

achieved by setting a limit on how many letters in a given candidate word can be further away 
from the corresponding input contact point than a preset threshold distance 
{maximum _key_distance). If this preset threshold maximum _key distance is set to a value 
other than MAX_DISTANCE VALUE (the maximum distance value that can be calculated by 

10 the distance measurement algorithm used in the system), then a second threshold 

over max ^distance Jimit is set to the maximum number of letters in a given candidate word 
that can be further than maximum Jkeyjdistance from the corresponding input contact point. 

Figures 4A through 4K show a flow chart of a preferred embodiment of a main routine 
of the auto-correction software 212 that generates and manages a word choice list to aid the 

15 user in efficiently utilizing inaccurate keystroke sequences. Figure 4A, shows the main 

processing routine, which when the system is first started, at block 4105 initializes both the 
word choice list and the current input sequence to an empty state. Then, at block 4110, the 
system waits to receive a keystroke from the touch screen 202. Upon receipt of a keystroke, at 
block 4115 the system determines whether the coordinates x/y of the received keystroke 

20 contact point lie within the boundaries of the auto-correcting keyboard region 106. If not, then 
the process indicated in block 4120 is carried out as shown in the flowchart of Figure 41, 
wherein the system processes the specific character or function associated with the defined key 
region containing contact point x/y. If at block 41 15 the received keystroke contact point x/y 
lies within the boundaries of the auto-correcting keyboard region 106, then at block 4130 the 

25 length k of the sequence is incremented by one, and the point x/y is appended to the end of the 
current input sequence as the k th input. Then at block 4140, the system sets the key distance 
table entries KDj k to the squares of the Cartesian distances from the k lh input point x/y to the 
nearest point on the perimeter of each key Kj in auto-correcting keyboard, setting KD ik to 0 
when x/y is in the interior of key region Kj. At block 41 50, for each possible character Q that 

30 is a valid character appearing in one or more words of the vocabulary modules 213, a translate 
table character jnap mapping each character value Q to its corresponding key K\ is used to set 
each element CD jk of the k lh row of the character _distance table to the squared distance KD ik 
from k lh input x/y to the key Kj corresponding to each possible character Q. This allows the 
distances used in calculating the matching metric to be calculated only once (when KD !k is set 

35 in the key distance table), and the translate table character jnap is likewise used only once 
when the character ^distance table is filled in from the key distance table. This allows for 
greater efficiency in the large number of calculations that would otherwise be repeated in 
processing the words of the vocabulary modules. 
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At decision block 4160, the variable maximum key _d is tance is checked to determine 
whether it is set to a value other than MAX_DISTANCE_VALUE, and if so, at block 4170, an 
auxiliary index array is sorted so that, in searching the words of the vocabulary modules, the 
x/y inputs can be processed in ascending order of the likelihood of matching a character 
5 mapped to a key close to the x/y input. Sorting the inputs x/y to be processed in this order 

tends to minimize the total amount of computation required to process the vocabulary modules 
for the input sequence, since this increases the likelihood across all words that one or more x/y 
inputs will be greater than maximum key ^distance from the corresponding key before all 
inputs have been processed, so that the remaining inputs need not be processed since the word 

10 will be disqualified as a candidate as soon as the number of such inputs exceeds 

over max distance Jimit. Figure 4K shows a preferred embodiment of how the likelihood of 
matching a character mapped to a key close to a given x/y input is calculated. A table 
character frequencies is included with the vocabulary modules wherein element 
character frequencies y contains the sum of the relative frequencies of all characters found as 

15 the j th character of any word in the vocabulary, and which are mapped in character _map to key 
Ki. At block 4 1 1 1 0, match probability k for the current input x/y is initialized to zero. In loop 
41 120, at decision block 41 130, if key Kj is no greater than maximum _key_distance from the 
input x/y, then at block 41 140 then match probability k is incremented by the value of 
character frequencies^ divided by the distance (KD^ + 1), where the distance KD,k is 

20 incremented by 1 to avoid division by zero and to scale each character frequencies^ 

appropriately. When all keys have been processed in loop 41 120, then at block 41 150, the 
index values l.k in array keyjindex are sorted in ascending order according to the values of 
match probability k that have been calculated for each input. 

Returning to Figure 4 A, at block 41 80, a new word choice list is computed based on the 

25 current input sequence of length k. Figure 4B shows a preferred embodiment for computing a 
new word choice list. At block 4210, any previously computed word choice list is cleared. At 
block 4220, if current input sequence includes two or more inputs, the software determines 
which of the three clustering regions can be excluded from consideration based on the 
coordinate locations of the first two inputs of the sequence. At block 4230, based on the 

30 excluded clustering regions, of the nine possible sub-groups of each word list in the vocabulary 
modules, the sub-groups of words which actually need to be processed are identified. Then, in 
block 4240, only these identified sub-groups are processed in order to identify candidate words 
to show in the word choice list. 

Figure 4C shows how the words in the identified sub-groups are processed to build a 

35 word choice list of the words that are most likely to match the user's intention in entering an 
input sequence of keystrokes consisting of contact points inside the auto-correcting keyboard 
region. Block 4305 defines the limits of the loop from block 43 10 through the test at block 
4380 for loop termination. Block 4380 tests whether there are any more words of length /r, the 
current length of the input sequence, that belong to one of the identified sub-groups, that have 
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not yet been processed. If not, then at block 4385 the system tests whether the word list has 
been filled, and whether the value of the matching metric calculated for each word in the list is 
less than a predetermined threshold. If so, then the word list is deemed to have been filled with 
potential matches of an appropriate quality, and at block 4395 the processor returns to block 
5 4260 in Figure 4B. If not, then at block 4390, the system tests whether there are any more 
words of any length greater than £, that belong to one of the identified sub-groups, that have 
not yet been processed. If not, then at block 4395 the processor returns to block 4260. 

Once the next word to be processed has been obtained from a vocabulary module at 
block 4310, at block 4315 word distance is set to zero so that the weighted distance from the 

10 input sequence to this word can be summed into this variable. Then at block 4320, 

over jnax jdistance count is set to zero so that the number of letters in the word that are further 
from the corresponding contact point in the input sequence than a preset maximum threshold 
maximum _key jdistance to this word can be counted with this variable. Then at block 4330, 
the word is processed as shown in Figure 4D. Block 4410 initializes the loop counter i to 1 and 

15 defines the limits of the loop from block 4420 through the test at block 4490 for loop 

termination when each of the k inputs in the current sequence have been processed. At block 
4420, the variable next_key is set to the next index array value key_index x , which was sorted 
according to Figure 4J so that inputs are processed starting with those for which the calculated 
key jdistance is most likely to exceed the threshold maximum _key distance. The loop counter i 

20 is then incremented. At block 4430, key jdistance is set to the corresponding value already 
calculated and stored as described above in the array CD, which contains the distance from 
each contact point in the input sequence to each possible character that occurs in any word of 
the vocabulary modules. At decision block 4440, if key distance exceeds the threshold 
maximum _key jdistance, then at block 4450 over max jdistance count is incremented and at 

25 decision block 4455 over jnax distance count is tested to determine if the maximum number 
over jnax jdistance _limit of such inputs has been exceeded. If so, loop 4410 is terminated 
early and the incorrect word distance, which is then returned at block 4490, is of no 
consequence since this word is disqualified from further consideration. If at decision block 
4440, key jdistance does not exceed the threshold maximum key jdistance, or if at decision 

30 block 4455, over jnax distance count does not exceed the threshold over max distance Jimit^ 
then at block 4460 key distance is summed into the total word jdistance being calculated for 
the current word. This process is repeated for each contact point in the input sequence until at 
decision block 4470 it is determined that all inputs have been processed. When this process 
returns from block 4490, either word distance has been correctly calculated for the current 

35 word, or over jnax jdistance jcount exceeds the threshold over max jdistance _limit, and the 
current word is disqualified from further consideration. 

Returning to Figure 4C at decision block 4335, over max distance count is tested to 
determine whether the current word is disqualified, and if so, execution proceeds to block 4380 
to test whether the word processing loop 4305 should terminate. If not, at decision block 4340, 
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if the length of the current word is greater than the number of inputs, then at block 4345 the 
calculated word distance is incremented by a fixed amount wrong Jengthjidditive _factor. In 
another preferred embodiment, wrong Jengthjidditive _Jactor is calculated as a function of the 
difference between the number of letters in the word and the number of contact points in the 
5 input sequence. In either case, processing returns to block 4350 where word distance is 

incremented by a fixed amount all jvord _additive ^factor, to prevent any word from having a 
wordjdistance value of zero, so that the match jnetric calculated for the word reflects its 
relative priority in the vocabulary module's list. At block 4355, a multiplicative factor 
idxjnultiplier is calculated to use as a weighting factor in combination with the calculated 

10 wordjdistance to determine the value of match jnetric for the word. In the preferred 

embodiment shown in Figure 4C, idxjnultiplier is calculated as the bit position of the highest 
bit set to 1 in a binary representation of the numeric index of the word's ordinal position in the 
list of which it is a member. This value ranges from 0 to 3 1 , where 0 corresponds to the 
highest bit position, and 31 the lowest. In block 4360, this value is incremented by 1 so the 

1 5 multiplicative factor used is greater than or equal to 1 so that any non-zero wordjdistance will 
result in a non-zero match jnetric value. 

If at decision block 4370 the match jnetric is less than the worst (i.e., highest) 
match jnetric score within the word choice list then the current word is inserted into the word 
choice list. If the match jnetric is greater than or equal to the worst match jnetric score within 

20 a full word choice list, than the current word is ignored. Decision block 4380 returns to block 
43 1 0 if there are any words left that have not been processed that are the same length k as the 
current input sequence. When decision block 4380 finds no more words to of length k to 
process, decision block 4385 tests whether the word choice list contains a full complement of 
matching words (in one preferred embodiment, a full complement consists of four words), each 

25 of which has a match jnetric value below a predetermined threshold. If at decision block 4385 
it is found that that the word list does not yet contain a full complement of matching words, 
then at block 4390 the system determines whether there are any words left of length greater 
than the current input sequence length k y and if so, execution continues from block 4310. 
Word testing continues until decision block 4385 finds that the word choice list is filled with 

30 matching words, or until decision block 4390 finds there are no additional words to test. 

Returning to Figure 4B, the word choice list is processed at block 4260. Figure 4E 
shows a preferred embodiment for processing the word choice list. At block 4510, the words 
selected for the word choice list are sorted in ascending order by word length, so that words 
whose lengths are equal to the number of keys for the entry string will be presented as the most 

35 likely alternatives within the word choice list. At block 4520, each group of words of the same 
length is sorted in ascending order according the values of match jnetric that have been 
calculated for each word, so that words with lower values of match jnetric are presented as the 
most likely alternatives. 
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Words may be included in the vocabulary modules that are marked as known 
misspellings, for example, words in English in which the correct order of the letters "i" and "e" 
is commonly reversed. In one preferred embodiment, when such misspelled words are 
identified as candidate words from the vocabulary modules and are included in the word choice 
5 list, they are flagged such that they can be replaced in the word list at block 4540 with the 
corresponding correctly spelled words. Similarly, words may also be included in the 
vocabulary modules that are flagged as macros or abbreviations that are associated with other 
strings of text to be output and/or designated functions to be executed upon selection of the 
associated word. Such macros are also replaced in the word choice with the corresponding 

10 associated text strings at block 4540. Block 4560 applies the proper shift states to all 

characters of words within the word choice list, transforming the relevant characters into their 
proper form of upper case or lower case, according to the shift state that was in effect at the 
time the keystroke corresponding to each letter was performed. At block 4580 duplicate words 
in the word choice list are eliminated by removing the lowest priority duplicates. 

15 In accordance with another aspect of the invention, a software application 

implementing the input method of the present invention is installed in an existing device. 
During the installation of this application in the device, existing information files are scanned 
for words to be added to the lexicon. Methods for scanning such information files are well 
known in the art. As new words are found during scanning, they are added to the lexicon 

20 structure as low frequency words, and as such are placed at the end of the word lists with 

which the words are associated. Depending on the number of times that a given new word is 
detected during the scanning, it is assigned a relatively higher and higher priority, by 
promoting it within its associated list, increasing the likelihood of the word appearing in the 
word selection list during information entry. 

25 In accordance with another aspect of the invention, the lexicon of words has an 

appendix of offensive words, paired with similar words of an acceptable nature, such that 
entering the offensive word, even through Exact Typing of the locations of the letters 
comprising the offensive word, yields only the associated acceptable word in the Exact Type 
field, and if appropriate as a suggestion in the Word Selection List. This feature can filter out 

30 the appearance of offensive words which might appear unintentionally in the selection list once 
the user learns that it is possible to type more quickly when less attention is given to contacting 
the keyboard at the precise location of the intended letters. Thus, using techniques that are 
well known in the art, prior to displaying the Exact Type word string, the software routine 
responsible for displaying the word choice list simply compares the current Exact Type string 

35 with the appendix of offensive words, and if a match is found, replaces the display string with 
the associated acceptable word. Otherwise, even when an offensive word is treated as a very 
low frequency word, it would still appear as the Exact Type word when each of the letters of 
the word is directly contacted. Although this is analogous to accidentally typing an offensive 
word on a standard keyboard, the system of the present invention is designed to allow and even 
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encourage the user to type with less accuracy. This feature can be enabled or disabled by the 
user, for example, through a system menu selection. 

Those skilled in the art will also recognize that additional vocabulary modules can be 
enabled within the computer, for example vocabulary modules containing legal terms, medical 
5 terms, and foreign language terms. Via a system menu, the user can configure the system so 
that the additional vocabulary words can be caused to appear first or last in the list of possible 
words, with special coloration or highlighting. Consequently, within the scope of the appended 
claims, it will be appreciated that the invention can be practiced otherwise than as specifically 
described herein. 

10 Returning through Figure 4B to Figure 4A, the word selection list is presented to the 

user and the main routine awaits the next keystroke from the touch screen 202 at block 4110. 
If, upon receipt of a keystroke, at block 4115 the system determines that the received keystroke 
contact point lies outside of the boundaries of the auto-correcting keyboard region 106, then 
the process of block 4120 is carried out as shown in Figure 41. Block 4910 identifies the 

1 5 character or function associated with the defined region. If at decision block 4920 the word 

choice list is empty, then block 4925 generates the character(s) associated with the defined key 
region, or executes the function associated with the defined key region, and at block 4970 the 
system returns to Figure 4A. If one or more words are currently displayed in the word choice 
list, decision block 4930 determines if the keystroke x/y coordinate falls within the word- 

20 choice list region 150. If so, at block 4935 the system processes the word selection from the 
word-choice list. 

In accordance with another aspect of the invention, the user presses a Space key to 
delimit an entered keystroke sequence. After receiving the Space key, the disambiguating 
system selects the most frequently used word and adds the word to the information being 

25 constructed. The Space key is used to delimit an entered sequence. 

In accordance with another aspect of the invention, the word selection list is presented 
as a vertical list of candidate words, with one word presented in each row, and wherein each 
row is further subdivided into regions or columns. The regions or columns define functions 
related to the acceptance of the candidate string displayed in the selected row, such as 

30 including or not including a trailing blank space, appending a punctuation mark or applying a 
diacritic accent. The function may be applied when the user touches the row of the intended 
string in the word selection list at a point contained in the region or column associated with the 
desired function. When the user selects the desired candidate word by contacting the row 
within certain regions or columns, the word is automatically "accepted" for output and is added 

35 to the information being composed. For example, contacting a row within the region 

associated with appending a trailing space immediately outputs the associated word with a 
trailing space. In accordance with another aspect of the invention, one such region or column 
is defined such that contacting a row within the region invokes a function which replaces the 
current input sequence of actual contact points with a sequence of contact points corresponding 
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to the coordinate locations of the letters comprising the word in the selected row, but without 
terminating the current input sequence. As a result, the selected word appears in the selection 
list as the Exact Type interpretation of the input sequence. In most cases, the selected word 
also appears as the most likely word interpretation of the input sequence, although if the letters 
5 of the word are each near those of a much more frequent word, the more frequent word will 
still appear as the most likely word interpretation. This ability to re-define the input sequence 
as the coordinate locations of the letters of the intended word without terminating the input 
sequence enables the user to then continue typing, for example, a desired inflection or suffix to 
append to the word. When the intended word is relatively infrequent, and especially when it is 

10 seen only infrequently with the desired inflection or suffix, this feature makes it easier for the 
user to type the desired infrequently occurring form of the infrequent word without needing to 
carefully type each letter of the word. Only one additional selection step is required when the 
uninfected form of the word is selected by contacting the associated row in the selection list in 
the region associated with this feature. 

15 Figure 4 J shows a preferred embodiment for processing word-choice list selections, 

registering contacts within regions 154, 157, or 160. Block 41010 identifies which row of the 
word-choice list was contacted and the associated word. Block 41020 identifies the word- 
choice list column that was contacted and the associated function F co i for that column. In the 
preferred embodiment shown in Figure IB, three different columns are defined: one to the left 

20 of column marker 1 70, one to the right of column marker 1 72, and one between the column 
markers 170 and 172. Decision block 41030 determines if the function F co i consists of 
replacing the input sequence with a new set of x/y locations of the word selected, which in the 
preferred embodiment shown in Figure IB, corresponds to an x/y location to the right of 
column marker 172. If so, block 41032 replaces the input sequence with the sequence of x/y 

25 locations corresponding to the characters of the selected word and at block 41034 a new word 
choice list is generated as shown in Figure 4B. If function F co i does not replace the input 
sequence, processing of the selected word from the word choice list continues. Block 41040 
adjusts the selected word's priorities. 

In accordance with another aspect of the invention, during use of the system by a user, 

30 the lexicon is automatically modified by a "Promotion Algorithm" which, each time a word is 
selected by the user, acts to "promote" that word within the lexicon by incrementally 
increasing the relative frequency associated with that word. In one preferred embodiment, the 
Promotion Algorithm increases the value of the frequency associated with the word selected by 
a relatively large increment, while decreasing the frequency value of those words passed over 

35 by a very small decrement. For a lexicon in which relative frequency information is indicated 
by the sequential order in which words appear in a list, promotions are made by moving the 
selected word upward by some fraction of its distance from the head of the list. The Promotion 
Algorithm is designed to tend to avoid moving the words most commonly used and the words 
very infrequently used very far from their original locations. In one preferred embodiment, 
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this is achieved by altering the fraction of the remaining distance by which a selected word is 
promoted depending on its current relative position in the overall list. For example, words in 
the middle range of the list are promoted by the largest fraction with each selection. Words 
intermediate between where the selected word started and finished in the lexicon promotion are 
5 effectively demoted by a value of one. Conservation of the "word list mass'' is maintained, so 
that the information regarding the relative frequency of the words in the list is maintained and 
updated without increasing the storage required for the list. 

In accordance with another aspect of the invention, the Promotion Algorithm operates 
both to increase the frequency of selected words, and where appropriate, to decrease the 

10 frequency of words that are not selected. For example, in a lexicon in which relative frequency 
information is indicated by the sequential order in which words appear in a list, a selected word 
which appears at position IDX in the list is moved to position (IDX/2). Correspondingly, 
words in the list at positions (IDX/2) down through (IDX+1) are moved down one position in 
the list. Words are demoted in the list when a sequence of contact points is processed and a 

1 5 word selection list is generated based on the calculated matching metric values, and one or 

more words appear in the list prior to the word selected by the user. Words that appear higher 
in the selection list but are not selected may be presumed to be assigned an inappropriately 
high frequency (i.e., they appear too high in the list). Such a word that initially appears at 
position IDX is demoted by, for example, moving it to position (IDX * 2 + 1). Thus, the more 

20 frequent a word is considered to be, the less it is demoted in the sense that it is moved by a 
smaller number of steps. 

In accordance with another aspect of the invention, the promotion and demotion 
processes may be triggered only in response to an action by the user, or may be performed 
differently depending on the user's input. For example, words that appear higher in a selection 

25 list than the word intended by the user are demoted only when the user selects the intended 
word by "clicking and dragging" the intended word to the foremost location within the word 
selection list, using a stylus or mouse. Alternatively, the selected word that is manually 
"dragged" to a higher position in the selection list may be promoted by a larger than normal 
factor. For example, the promoted word is moved from position IDX to position (IDX/3). 

30 Many such variations will be evident to one of ordinary skill in the art. 

Figure 4F shows a preferred embodiment for adjusting the prioritization of words when 
a word is picked from the word choice list. Decision block 4610 determines if the selected 
word choice item is the exact-type word, that string of characters whose x/y locations happens 
to correspond exactly to the sequence of contact points, which in a preferred embodiment is 

35 displayed in a distinct location in the word choice list, such as in Figure IB where the exact- 
type word 154 ("rwzt" in the example shown) is separated by a solid line from other words in 
the list. If the chosen word is not the exact-type word, such as items 157 or 160, at block 4620 
the chosen word is promoted (as in the preferred embodiment shown in Figure 4G), and at 
block 4630 each of the words that appeared in the word choice list ahead of the chosen word 
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are explicitly demoted (as in the preferred embodiment shown in Figure 4H, in contrast to the 
incidental demotion that may occurs to one or more words simply as a consequence of the 
promotion of another word). 

If at block 4610, the chosen word is determined to be the exact-type word, decision 
5 block 4640 identifies if that word is a new word that is not yet included in the vocabulary 
modules. If not, then at block 4650 promotes the chosen word is promoted. If the chosen 
exact-type word is not yet included in the vocabulary modules, block 4660 identifies the 
appropriate word-list to which the word is to be added. Decision block 4665 identifies if space 
is available within the appropriate word-list, And if not, at block 4670 the last, least likely 

10 word in the appropriate word-list is deleted to make room for the word to be added. At block 
4675 the new word is added as the least likely word in the appropriate word-list, then at block 
4680 the newly added word is promoted without explicitly demoting the other words that 
appeared in the word choice list. 

Figure 4G shows a preferred embodiment of the word promotion performed in blocks 

15 4620, 4650 and 4680. Block 4710 identifies the chosen word's position within its word-list, 

and assigns idx that position value. Block 4720 defines newjdx as one half of the value of idx, 
specifying a position in the list halfway from the current position to the head of the list (the 
position of the word deemed to be most likely to be selected). Block 4730 demotes all words 
in positions between idx and new_idx by one position, filling the word's old position at idx and 

20 making room for the word at new idx. Block 4740 then promotes the chosen word by 

inserting it back into the list at position newjdx. Note that this preferred method of promotion 
essentially has the effect of decrementing by 1 the idx multiplier calculated for the word at 
block 4355. 

Figure 4H shows a preferred embodiment of explicit word demotion performed in 
25 block 4635. Block 4810 identifies the explicitly demoted word's position within its word-list, 
and assigns idx that position value. Block 4820 defines newjdx as double the value of idx plus 
one. Decision block 4830 compares the value of newjdx to the total number of words in the 
word-list. If newjdx is greater than the total number of words in the word-list, Block 4835 
sets newjdx equal to the value of the number of words in the word-list, since a word can be 
30 demoted no further than the end of the list. Block 4840 promotes all words in positions 

between idx and newjdx by one position, filling the word's old position at idx and making 
room for the word at newjdx. Block 4850 then demotes the chosen word by inserting it back 
into the list at position newjdx. Note that this preferred method of demotion essentially has 
the effect of incrementing by 1 the idx multiplier calculated for the word at block 4355. 
35 Figures 5A through 5E are schematic views showing a sequence of character inputs as 

an illustrative example of entering a word on the preferred embodiment of a portable computer 
102 incorporating a reduced auto-correcting keyboard system 100 formed in accordance with 
the present invention as shown in Figures 1 A and IB. Portable computer 102 contains a 
reduced keyboard 105 implemented on a touch screen display 103, which is used to generate 
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text to be output to text display region 104. 

Figure 5 A shows the location 510 of the first keystroke of a sequence of keystrokes 
corresponding to the entry of the word "text." In response to the keystroke 501 , the auto- 
correcting keyboard system displays the word choice list region 150 super-imposed on top of 
5 the text region 104, showing a list of words and other interpretations corresponding to the 

keystroke. In this example, the coordinate location 510 of the keystroke is physically nearest 
coordinate location associated with the letter "r." The word choice list includes "R" 51 1 as the 
default choice, shown in the word choice list in the position nearest the auto-correcting 
keyboard region 106. Since the letter "r", when it occurs as a "word" that is only one letter in 

10 length, is found more frequently in upper case (such as when "R" appears as an initial included 
as part of a person's name), "R" is offered in the word choice list in upper case. This is in 
accordance with the aspect of the present invention wherein information regarding the 
capitalization of each word is stored along with the word in the vocabulary modules so that the 
word can be presented in a preferred form of capitalization without requiring the user to 

15 activate keys (such as a Shift key) to specify the capitalization of the word entered. The word 
choice list shows "are" 512 as the next most likely choice, in accordance with the aspect of the 
present invention wherein a word or symbol may be associated with an arbitrary sequence of 
one or more letters, such that when the associated sequence of letters is entered by the user, the 
word or symbol is offered as a choice in the word choice list. In this example, the word "are" 

20 is associated as a "macro" expansion of the single letter "r" which is pronounced the same in 
English. Similarly, the word choice list shows "®" 513 as the third most likely choice, where 
this symbol has been included in the vocabulary modules based on the logical association with 
the letter "r." The word choice list shows "a" 514 as the fourth most likely choice, where "a" 
is a very commonly occurring word of one letter, so that it appears as a candidate in the word 

25 choice list even though the coordinate location associated with the letter "a" is relatively 

distant from contact location 501. Above these choices is the exact type region displaying "r" 
515 as an option for selection, since the coordinate location associated with the letter "r" is 
closer than that of any other letter to coordinate location 510 of the keystroke. 

Figure 5B shows the location 520 of the next keystroke, nearest to the coordinate 

30 location associated with the letter "w." The word choice list includes "re" 521 as the default 
choice, "Re" 522 as the next most likely choice, "ra" 523 as the third most likely choice and 
"Rs" 524 as the fourth most likely choice. Above these choices is the exact type region 
displaying "rw" 525 that is an option for selection. 

Figure 5C shows the location 530 of the next keystroke, nearest to the coordinate 

35 location associated with the letter "z." The word choice list includes "tax" 531 as the default 
choice, "Rex" 532 as the next most likely choice, "fax" 533 as the third most likely choice and 
"was" 534 as the fourth most likely choice. Above these choices is the exact type region 
displaying "rwz" 535 that is an option for selection. 
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Figure 5D shows the location 540 of the next keystroke, very near the coordinate 
location associated with the letter "t." The word choice list includes "text" 541 as the default 
choice, "year" 542 as the next most likely choice, "rest" 543 as the third most likely choice and 
"fact" 544 as the fourth most likely choice. Above these choices is the exact type region 
displaying "rwzt" 545 that is an option for selection. The word "text" is to be entered as the 
next word. 

Figure 5E shows the next keystroke at 550, in the region designated as the "space" key. 
The space key is outside of the auto-correcting keyboard region 106, and thus can be 
unambiguously associated with a specific function. The space key acts to accept the default 
word "text" 541 and enters the word "text" 542 in the text output region 104 at the insertion 
point 107 in the text being generated where the cursor was last positioned. Simultaneously, the 
current input sequence is cleared, and the word choice list display is removed from the display 
screen 103 of the portable computer 102 so that the text output region 104 is unobscured. 

While the preferred embodiment of the invention has been illustrated and described, it 
will be appreciated that various changes can be made therein without departing from the spirit 
and scope of the invention. For example, those skilled in the art will appreciate that the 
keyboard 105 and its auto-correcting keyboard region 106 may be configured in various ways, 
and may have a varying number of explicit function keys 108 - 122. The auto-correction 
technique disclosed herein is equally applicable to keyboards of different sizes, and to 
traditional mechanical keyboards of various sizes as well as touch-panel and touch-screen 
based keyboards such as the one described in the preferred embodiment. The specific format 
of the word choice list 150, such as the number of word choices presented, the arrangement of 
the word choices, and the functions associated with different areas of the word choice list may 
be changed. For example, those skilled in the art will appreciate that the function that consists 
of replacing the input sequence with a new set of x/y locations of the word selected could be 
omitted in certain applications. Furthermore, the specific algorithms used in promoting and 
demoting words within the vocabulary modules could also be altered. For example, a selected 
word could be promoted by moving it l A of the distance to the head of its list rather than the 
factor of Vi used in the preferred embodiment described above. 
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1 . A text entry system comprising: 

(a) a user input device comprising a touch sensitive surface including an auto- 
correcting keyboard region comprising a plurality of the characters of an alphabet, wherein 
each of the plurality of characters corresponds to a location with known coordinates in the 

5 auto-correcting keyboard region, wherein each time a user contacts the user input device within 
the auto-correcting keyboard region, a location associated with the user contact is determined 
and the determined contact location is added to a current input sequence of contact locations; 

(b) a memory containing a plurality of objects, wherein each object is further 
associated with a frequency of use; 

1 0 (c) an output device with a text display area; and 

(d) a processor coupled to the user input device, memory, and output device, said 
processor comprising: 

(i) a distance value calculation component which, for each determined 
contact location in the input sequence of contacts, calculates a set of distance values between 

15 the contact locations and the known coordinate locations corresponding to one or a plurality of 
characters within the auto-correcting keyboard region; 

(ii) a word evaluation component which, for each generated input sequence, 
identifies one or a plurality of candidate objects in memory, and for each of the one or a 
plurality of identified candidate objects, evaluates each identified candidate object by 

20 calculating a matching metric based on the calculated distance values and the frequency of use 
associated with the object, and ranks the evaluated candidate objects based on the calculated 
matching metric values; and 

(iii) a selection component for identifying one or a plurality of candidate 
objects according to their evaluated ranking, presenting the identified objects to the user, and 

25 enabling the user to select one of the presented objects for output to the text display area on the 
output device. 

2. The text entry system of claim 1, wherein: 

(a) each of the plurality of objects in memory is further associated with one or a 
plurality of predefined groupings of objects; and 
30 (b) the word evaluation component, for each generated input sequence, limits the 

number of objects for which a matching metric is calculated by identifying one or a plurality of 
candidate groupings of the objects in memory, and for one or a plurality of objects associated 
with each of the one or a plurality of identified candidate groupings of objects, calculates a 
matching metric based on the calculated distance values and the frequency of use associated 
35 with each candidate object, and ranks the evaluated candidate objects based on the calculated 
matching metric values. 

3 . The system of claim 1, wherein the characters of the alphabet are arranged on 
the auto-correcting keyboard region in approximately a standard "QWERTY" layout. 

4. The system of claim 3, wherein the width to height ratio of the auto-correcting 
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keyboard region is approximately 2 to 1 . 

5. The system of claim 3, wherein the width to height ratio of the auto-correcting 
keyboard region is less than 2 to 1 . 

6. The system of claim 3, wherein one or a plurality of the characters arranged on 
5 the auto-correcting keyboard region are illegible. 

7. The system of claim 1, wherein said auto-correcting keyboard region includes 
one or a plurality of known locations associated with one or a plurality of punctuation 
characters, wherein said memory includes one or a plurality of objects in memory which 
include one or a plurality of the punctuation characters associated with locations in said auto- 

1 0 correcting keyboard region. 

8. The system of claim 1, wherein objects in memory are further associated with 
one or a plurality of modules, wherein each module comprises a set of objects with one or a 
plurality of common characteristics. 

9. The system of claim 8, wherein the text entry system comprises a module 
15 selector whereby a user can determine which modules are to be evaluated by the word 

evaluation component in order to identify candidate objects. 

10. The system of claim 8, wherein the plurality of modules comprises word stem 
modules and suffix modules, wherein each word stem module comprises a logical organization 
of uninfected word stem objects, and wherein each suffix module comprises a logical 

20 organization of suffixes which can be appended to word stems to form inflected words, 
whereby 

each word stem module is associated with one or a plurality of suffix modules, whereby 
whenever the word evaluation component calculates a matching metric value for a given word 
stem in a given word stem module with respect to an initial sequence of contacts within an 

25 input sequence such that the calculated matching metric value ranks higher than a 

predetermined threshold, the word evaluation component evaluates the remaining contacts of 
the input sequence with respect to the associated suffix modules, whereby whenever the word 
evaluation component calculates a matching metric value for a given suffix in one of said 
associated suffix modules that ranks higher than a second predetermined threshold, said suffix 

30 is appended to said word stem to form a completed word corresponding to a matching metric 
value that is a function of said determined word stem matching metric value and said 
determined suffix matching metric value. 

1 1 . The system of claim 1 , wherein the word evaluation component calculates the 
matching metric for each candidate object by summing the distance values calculated from 

35 each contact location in the input sequence to the location assigned to the character in the 

corresponding position of the candidate object, and applying a weighting function according to 
the frequency of use associated with the object. 

12. The system of claim 1 1, wherein each character of the alphabet associated with 
the auto-correcting keyboard region is assigned a Cartesian coordinate and wherein the 
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distance value calculation component calculates the distance between the contact location and 
the location corresponding to a character according to standard Cartesian coordinate distance 
analysis. 

13. The system of claim 1 1, wherein each character of the alphabet associated with 
5 the auto-correcting keyboard region is assigned a Cartesian coordinate and wherein the 

distance value calculation component calculates the distance between the contact location and 
the location corresponding to a character as the square of the standard Cartesian coordinate 
distance. 

14. The system of claim 1 1 , wherein the distance values are placed in a table. 

10 15. The system of claim 1 1, wherein each location on the auto-correcting keyboard 

region is defined by a horizontal and a vertical coordinate, and wherein the distance value 
between a contact location and the known coordinate location corresponding to a character 
comprises a horizontal and a vertical component, wherein the vertical component is adjusted 
by a weighting factor in calculating the distance of the contact location from the character. 

15 16. The system of claim 1 1 , wherein the frequency of use associated with each 

candidate object in memory comprises the ordinal ranking of the object with respect to other 
objects in memory, wherein an object associated with a higher relative frequency corresponds 
to a numerically lower ordinal ranking. 

17. The system of claim 16, wherein the frequency weighting function applied by 
20 the word evaluation component to the summed distance values for a candidate object 

comprises multiplying the sum of the distance values by the base 2 logarithm of the ordinal 
ranking of the object. 

18. The system of claim 11, wherein the word evaluation component adds an 
increment value to the sum of the distance values prior to applying a weighting function 

25 according to the frequency of use associated with the candidate object. 

19. The system of claim 1 8, wherein the increment value is a fixed value that is 
approximately twice the average distance between adjacent locations on the auto-correcting 
keyboard region corresponding to characters. 

20. The system of claim 2, wherein objects in memory are stored such that objects 
30 are classified into groupings comprising objects of the same length. 

21 . The system of claim 20, wherein the word evaluation component limits the 
number of objects for which a matching metric is calculated by initially identifying candidate 
groupings of objects of the same length as the number of inputs in the input sequence. 

22. The system of claim 21, wherein if fewer than a threshold number of candidate 
35 objects are evaluated to have a matching metric score better than a threshold value, the word 

evaluation component identifies candidate groupings of objects of progressively longer lengths 
and calculates the matching metric for the objects in the identified groupings until said 
threshold number of candidate objects are evaluated to have a matching metric score better 
than said threshold. 



WO 00/74240 



PCT/US00/14684 



23. The system of claim 22, wherein the word evaluation component calculates the 
matching metric for each candidate object by summing the distance values calculated from 
each contact location in the input sequence to the location assigned to the character in the 
corresponding position of the candidate object and adding an increment value, and applying to 

5 this sum a weighting function according to the frequency of use associated with the object, and 
wherein the increment value added to the sum of the distance values is a value that is based on 
the difference between the number of characters in the candidate object and the number of 
inputs in the current input sequence. 

24. The system of claim 2, wherein the word evaluation component calculates the 
10 matching metric for each candidate object by summing the distance values calculated from 

each contact location in the input sequence to the location assigned to the character in the 
corresponding position of the candidate object, and applying a weighting function according to 
the frequency of use associated with the object. 

25. The system of claim 24, wherein the frequency of use associated with each 
15 candidate object in memory comprises the ordinal ranking of the object with respect to other 

objects in one or a plurality of sub-groupings in memory with which said object is associated, 
wherein an object associated with a higher relative frequency corresponds to a numerically 
lower ordinal ranking. 

26. The system of claim 24, wherein for each calculated distance value between a 
20 contact location in the input sequence and the known coordinate location corresponding to a 

character within the auto-correcting keyboard region wherein said calculated distance exceeds 
a threshold distance value, for each object in memory in which said character occurs at a 
position in the sequence of the characters of said object corresponding to the position of said 
contact location in said input sequence, said object is ranked by the word evaluation 
25 component as an object that is excluded from presentation to the user for selection. 

27. The system of claim 26, wherein one or a plurality of the identified candidate 
groupings of the objects in memory comprise objects that are excluded from presentation to the 
user for selection, wherein at least one of the calculated distance values included in the 
calculated sum of distance values for each object in said one or identified candidate groupings 

30 of objects exceeds a threshold distance value. 

28. The system of claim 27, wherein the auto-correcting keyboard region is 
separated into two or more predefined clustering regions, each of which contains the known 
locations of one or a plurality of characters, and wherein each objects in memory is assigned to 
a predefined group according to which of said two or more predefined clustering regions 

35 contain the known locations corresponding to one or a plurality of the initial characters of said 
object. 

29. The system of claim 28, wherein the auto-correcting keyboard region is 
separated into three predefined clustering regions, and wherein each object in memory is 
assigned to one of nine predefined groupings based which of the three predefined clustering 
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regions contain the known locations corresponding to each of the first two characters of said 
object. 

30. The system of claim 1, wherein for each character corresponding to a known 
location in the auto-correcting keyboard region, a region is predefined around one or a plurality 

5 of said known locations wherein the distance between an input contact location falling within 
said predefined region and the known character location within said predefined region is 
calculated as a distance of zero. 

31. The system of claim 30, wherein the relative sizes of said predefined regions 
correspond to the relative frequencies of occurrence of the characters associated with the 

10 known locations within said predefined regions. 

32. The system of claim 30, wherein said predefined region around the known 
location of a character corresponds to a displayed key on the touch screen. 

33. The system of claim 1, wherein at least one of the locations with known 
coordinates in the auto-correcting keyboard region corresponds to a plurality of characters, one 

15 or a plurality of which include various diacritic marks, wherein the plurality of characters 

comprise variant forms of a single base character, and wherein objects in memory are stored 
with their correct accented characters. 

34. The system of claim 1, wherein the selection component presents the identified 
one or a plurality of candidate objects for selection by the user in a candidate object list in the 

20 text display area. 

35. The system of claim 34, wherein the selection component identifies the highest 
ranked candidate object and presents the identified object in the candidate object list in the 
position nearest to the auto-correcting keyboard region. 

36. The system of claim 1, wherein a user selection of a character that is associated 
25 with a contact outside of the auto-correcting keyboard region accepts and outputs the 

determined highest ranked candidate object at a text insertion point in the text display area 
prior to outputting the selected character at the text insertion point in the text display area. 

37. The system of claim 1, wherein user selection of an object for output at a text 
insertion point in the text display area terminates the current input sequence such that the next 

30 contact within the auto-correcting keyboard region starts a new input sequence. 

38. The system of claim 1, wherein the selection component detects a distinctive 
manner of selection that is used to select a candidate object, and wherein upon detecting that an 
object has been selected through said distinctive manner, the system replaces the current input 
sequence of actual contact locations with an input sequence of contact locations corresponding 

35 to the coordinate locations of the characters comprising the selected object, and wherein a next 
contact in the auto-correcting keyboard region is appended to the current input sequence. 

39. The system of claim 1, wherein the word evaluation component determines, for 
each determined contact location in each input sequence of contact locations, the closest 
known location corresponding to a character, and constructs an exact typing object composed 
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of said determined corresponding characters in the order corresponding to the input sequence 
of contact locations. 

40. The system of claim 39, wherein for each input sequence of contact locations, 
the selection component presents said exact typing object to the user for selection. 
5 41 . The system of claim 40, wherein when the user selects said exact typing object 

for output to the text display area on the output device and said exact typing object is not 
already included as one of the objects in memory, said exact typing object is added to the 
memory. 

42. The system of claim 40, wherein prior to displaying said exact typing object to 
10 the user for selection, the selection component compares the exact typing object to a database 

of offensive objects, each of which is associated with an acceptable alternative object for 
display, and if a match is found, replaces the exact typing object with the associated acceptable 
object for presentation to the user. 

43. The system of claim 1, wherein the selection component identifies the highest 
15 ranked candidate object and presents the identified object at the text insertion point in the text 

display area on the output device. 

44. The system of claim 43, wherein the text entry system includes a select key 
region associated with an object selection function, wherein when said select key region is 
contacted, the object presented at the text insertion point in the text display area on the output 

20 device is replaced with next highest ranked object of the identified one or a plurality of 
candidate objects. 

45. The system of claim 1, wherein the text entry system includes a delete key 
region associated with a delete function, wherein when the current input sequence includes at 
least one contact and said delete key region is contacted, the last input contact from the current 

25 input sequence of contacts is deleted, without terminating the current input sequence. 

46. The system of claim 1, wherein the text entry system includes an Edit Word key 
region associated with an Edit Word function, wherein when no current input sequence exists 
and said Edit Word key region is contacted, 

when the text insertion point in the text display area on the output device is contained 
30 within a previously output word, the system establishes a new current input sequence 
consisting of a sequence of contact locations corresponding to the coordinate locations 
associated with the characters of said word, and 

when the text insertion point in the text display area on the output device is located 
between two previously output words, the system establishes a new current input sequence 
35 consisting of a sequence of contact locations corresponding to the coordinate locations 
associated with the characters of the word adjacent to the text insertion point, and 

wherein the text entry system processes said new current input sequence and 
determines a corresponding ranking of new candidate objects, and 

wherein selection of one of the new candidate objects replaces the previously output 
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word used to establish said new current input sequence. 

47. The system of claim 1, wherein as the user enters an input sequence by 
performing a sequence of contact actions within the auto-correcting keyboard region, the 
processor determines the location associated with each user contact action by recording each 

5 contact action in the sequence as an indexed primary set of a fixed number of two or more 
regularly spaced contact points along the path traced out by the user contact action, and by 
assembling two or more corresponding secondary sets of contact points by taking, for each of 
the two or more possible primary index values, the sequence of contact points having the same 
index value, one from each recorded indexed primary set of contact points, and by determining 

1 0 with respect to each word that is selected by the user for output, a minimizing primary index 
value that identifies the assembled secondary set of contact points for which the calculated 
distance between the assembled secondary set of contact points and the known locations 
corresponding to the characters of the selected word is minimized, and whereby for a next 
input sequence of user contact actions, the distance value calculation component calculates 

15 distance values based on a sequence of contact locations determined as the secondary set of 
contact point locations assembled from said next input sequence of contact actions 
corresponding to the determined minimizing primary index value. 

48. The system of claim 47, wherein for a plurality of user input sequences, the 
distance value calculation component computes a running average of the distance calculations 

20 for each of the two or more assembled secondary sets corresponding to the two or more 

primary index values, and whereby for a next input sequence of contact actions, the distance 
value calculation component calculates distance values based on a sequence of contact 
locations determined as the secondary set of contact point locations assembled from said next 
input sequence of contact actions corresponding to the minimizing primary index value 

25 determined with respect to said computed running averages. 

49. The system of claim 48, wherein for each primary index value, the distance 
value calculation component computes a running average of the horizontal and vertical 
components of the offset of the coordinate location corresponding to each character of each 
selected word with respect to the coordinate location of each corresponding recorded indexed 

30 contact point, and wherein in performing distance calculations for the word evaluation 
component, the distance value calculation component adjusts the horizontal and vertical 
coordinates of each recorded indexed contact point by an amount that is a function of the 
average horizontal and vertical offsets computed with respect to the corresponding primary 
index value. 

35 50. The system of claim 1, wherein for each input contact location, the distance 

value calculation component computes a running average of the horizontal and vertical 
components of the offset of the coordinate location corresponding to each character of each 
selected word with respect to the coordinates of each corresponding input contact location, and 
wherein in performing distance calculations for the word evaluation component, the distance 
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value calculation component adjusts the horizontal and vertical coordinates of each input 
contact location by amounts that are functions of the computed average signed horizontal and 
vertical offsets. 

5 1 . The system of claim 1, wherein the processor further a stroke recognition 

5 component that determines for each user contact action within the auto-correcting keyboard 
region whether the point of contact is moved less than a threshold distance from the initial 
contact location prior to being lifted from the touch sensitive surface, and whereby 

when the point of contact is moved less than a threshold distance from the initial 
contact location prior to being lifted from the touch sensitive surface, the stroke recognition 
10 component determines that the user contact is a tap contact, and the location determined to be 
associated with the user contact is added to the current input sequence of contact locations to 
be processed by the distance value calculation component, the word evaluation component, and 
the selection component, and whereby 

when the point of contact is moved greater than or equal to a threshold distance from 
15 the initial contact location prior to being lifted from the touch sensitive surface, the stroke 

recognition component determines that the user contact is one of a plurality of stroke contacts 
that are associated with known system functions, and classifies the stroke contact as one of the 
plurality of predefined types of stroke contacts. 

52. The system of claim 1 , wherein when a threshold number of contact locations in 
20 the input sequence are further than a threshold maximum distance from the corresponding 

character in the sequence of characters comprising a given candidate object, said object is 
identified as no longer being a candidate object for the selection component. 

53. The system of claim 1, wherein the processor further comprises a frequency 
promotion component for adjusting the frequency of use associated with each object in 

25 memory as a function of the number of times the object is selected by the user for output to the 
text display area on the output device. 

54. The system of claim 53, wherein the frequency of use associated with each 
object in memory comprises the ordinal ranking of the object with respect to other objects in 
memory, wherein an object associated with a higher relative frequency corresponds to a 

30 numerically lower ordinal ranking, and wherein when an object is selected for output by the 
user, the frequency promotion component adjusts the ordinal ranking associated with said 
selected object by an amount that is a function of the ordinal ranking of said object prior to said 
adjustment. 

55. The system of claim 54, wherein the function used by the frequency promotion 
35 component to determine the amount by which the ordinal ranking associated with a selected 

object is adjusted reduces said amount for objects with ordinal rankings that are associated 
with relatively higher frequencies of use. 

56. The system of claim 53, wherein the frequency promotion component analyzes 
additional information files that are accessible to the text entry system to identify new objects 
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contained in said files that are not included among the objects already in said memory of said 
text entry system, and wherein said newly identified objects are added to the objects in 
memory as objects that are associated with a low frequency of use. 

57. The system of claim 56, wherein the frequency of use associated with a newly 

5 identified object that is added to the objects in memory is adjusted by the frequency promotion 
component as a function of the number of times that the newly identified object is detected 
during the analysis of said additional information files. 

58. The system of claim 2, wherein the processor further comprises a frequency 
promotion component for adjusting the frequency of use associated with each object in 

10 memory as a function of the number of times the object is selected by the user for output to the 
text display area on the output device. 

59. The system of claim 58, wherein when an object is selected by the user for 
output to the text display area on the output device, the frequency promotion component 
increases the value of the frequency associated with the selected object by a relatively large 

1 5 increment, and decreases by a relatively small decrement the frequency associated with 
unselected objects that are associated with the same grouping as the selected object. 

60. The system of claim 1 , wherein information regarding the capitalization of one 
or a plurality of objects is stored along with the objects in memory and wherein 

the selection component presents each identified object in a preferred form of capitalization 
20 according to the stored capitalization information. 

61. The system of claim 1 , wherein one or a plurality of objects in memory are 
associated with a secondary object in memory comprising a sequence of one or a plurality of 
letters or symbols, and wherein when the selection component identifies one of said objects for 
presentation to the user based on the matching metric calculated by the word evaluation 

25 component, the selection component presents the associated secondary object for selection. 

62. A text entry system comprising: 

(a) a user input device comprising a keyboard constructed with mechanical keys 
including an auto-correcting keyboard region comprising a plurality of keys, each 
corresponding to a character of an alphabet and each at a known coordinate location, wherein 

30 each time a user activates one or a plurality of adjacent keys in the auto-correcting keyboard 
region within a predetermined threshold period of time to generate a key activation event, a 
determined location corresponding to the key activation event is appended to a current input 
sequence of the determined locations of the key activation events; 

(b) a memory containing a plurality of objects, wherein each object is a string of 
35 one or a plurality of characters forming a word or a part of a word, wherein each object is 

further associated with a frequency of use; 

(c) an output device with a text display area; and 

(d) a processor coupled to the user input device, memory, and output device, said 
processor comprising: 
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(i) a distance value calculation component which, for each generated key 
activation event location in the input sequence of key activation events, calculates a set of 
distance values between the key activation event location and the known coordinate locations 
corresponding to one or a plurality of keys within the auto-correcting keyboard region; 
5 (ii) a word evaluation component which, for each generated input sequence, 

identifies one or a plurality of candidate objects in memory, and for each of the one or a 
plurality of identified candidate objects, evaluates each identified candidate object by 
calculating a matching metric based on the calculated distance values and the frequency of use 
associated with the object, and ranks the evaluated candidate objects based on the calculated 
10 matching metric values; and 

(iii) a selection component for identifying one or a plurality of candidate 
objects according to their evaluated ranking, presenting the identified objects to the user, and 
enabling the user to select one of the presented objects for output to the text display area on the 
output device. 

15 63. The text entry system of claim 62, wherein: 

(a) each of the plurality of objects in memory is further associated with one or a 
plurality of predefined groupings of objects; and 

(b) the word evaluation component, for each generated input sequence, limits the 
number of objects for which a matching metric is calculated by identifying one or a plurality of 

20 candidate groupings of the objects in memory, and for one or a plurality of objects associated 
with each of the one or a plurality of identified candidate groupings of objects, calculates a 
matching metric based on the calculated distance values and the frequency of use associated 
with each candidate object, and ranks the evaluated candidate objects based on the calculated 
matching metric values. 

25 64. The system of claim 62, wherein the keys associated with the characters of the 

alphabet are arranged in the auto-correcting keyboard region in approximately a standard 
"QWERTY" layout. 

65. The system of claim 62, wherein when a key activation event is detected 
comprising the simultaneous activation of a plurality of adjacent keys in the auto-correcting 

30 keyboard region, a location corresponding to said key activation event is determined as a 

function of the locations of the simultaneously activated keys, and said determined location is 
appended to the current input sequence of the locations of the key activation events. 

66. The system of claim 65, wherein the function used to determine the location of 
said key activation event comprises the computation of the location corresponding to the center 

35 of the locations of the simultaneously activated keys. 

67. The system of claim 66, wherein the function used to determine the location of 
said key activation event comprises the computation of the location corresponding to the 
weighted center of gravity of the locations of the simultaneously activated keys, wherein the 
weights associated with each of the keys in the auto-correcting keyboard region correspond to 
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the relative frequencies of occurrence of the characters associated with the keys, wherein said 
relative frequencies are determined with respect to the frequencies of occurrence of the 
characters in the objects in memory. 

68. The system of claim 62, wherein when a key activation event is detected 

5 comprising the activation of a plurality of adjacent keys in the auto-correcting keyboard region 
within a predetermined threshold period of time, wherein at all times during said key activation 
event at least one of said plurality of adjacent keys is activated and wherein at any moment 
during said key activation event that any subset of said plurality of keys is simultaneously 
activated, said simultaneously activated subset of keys comprises keys that are contiguously 
10 adjacent, a location corresponding to said key activation event is determined as a function of 
the locations of the entire plurality of adjacent keys detected during said key activation event, 
and said determined location is appended to the current input sequence of the locations of the 
key activation events. 

69. The system of claim 68, wherein the function used to determine the location of 

1 5 said key activation event comprises the computation of the location corresponding to the center 
of the locations of the simultaneously activated keys. 

70. The system of claim 69, wherein the function used to determine the location of 
said key activation event comprises the computation of the location corresponding to the 
weighted center of gravity of the locations of the simultaneously activated keys, wherein the 

20 weights associated with each of the keys in the auto-correcting keyboard region correspond to 
the relative frequencies of occurrence of the characters associated with the keys, wherein said 
relative frequencies are determined with respect to the frequencies of occurrence of the 
characters in the objects in memory. 

71. The system of claim 62, wherein said auto-correcting keyboard region includes 
25 one or a plurality of keys associated with one or a plurality of punctuation characters, wherein 

said memory includes one or a plurality of objects in memory which include one or a plurality 
of the punctuation characters associated with keys in said auto-correcting keyboard region. 

72. The system of claim 62, wherein the word evaluation component calculates the 
matching metric for each candidate object by summing the distance values calculated from 

30 determined location in the input sequence to the known location of the key corresponding to 
the character in the corresponding position of the candidate object, and applying a weighting 
function according to the frequency of use associated with the object. 

73. The system of claim 62, wherein at least one of the keys in the auto-correcting 
keyboard region corresponds to a plurality of characters, one or a plurality of which include 

35 various diacritic marks, wherein the plurality of characters comprise variant forms of a single 
base character, and wherein objects in memory are stored with their correct accented 
characters. 

74. The system of claim 62, wherein the selection component presents the identified 
one or a plurality of candidate objects for selection by the user in a candidate object list in the 
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text display area. 

75. The system of claim 74, wherein the selection component identifies the highest 
ranked candidate object and presents the identified object in the candidate object list in the 
position nearest to the auto-correcting keyboard region. 
5 76. The system of claim 74, wherein activation of a key that is associated with a 

character, wherein said key is not included within the auto-correcting keyboard region, accepts 
and outputs the determined highest ranked candidate object at a text insertion point in the text 
display area prior to outputting the selected character at the text insertion point in the text 
display area. 

10 77. The system of claim 74, wherein user selection of an object for output at a text 

insertion point in the text display area terminates the current input sequence such that the next 
key activation event within the auto-correcting keyboard region starts a new input sequence. 
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